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ABSTRACT 



A signal processing system which discriminates between 
voice signals and data signals modulated by a voiceband 
carrier. The signal processing system includes a voice 
exchange, a data exchange and a call discriminator. The 
voice exchange is capable of exchanging voice signals 
between a circuit switched network and a packet based 
network. The signal processing system also includes a data 
exchange capable of exchanging data signals modulated by 
a voiceband carrier on the circuit switched network with 
unmodulated data signal packets on the packet based net- 
work. The data exchange is performed by demodulating data 
signals from the circuit switched network for transmission 
on the packet based network, and re-modulating data signal 
packets from the packet based network for transmission on 
the circuit switched network. The call discriminator is used 
to selectively enable the voice exchange and data exchange. 
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PACKET BASED NETWORK EXCHANGE 
WITH RATE SYNCHRONIZATION 

CROSS REFERENCE TO RELATED 
APPLICATIONS 

The present application is a continuation-in-part of 
co-pending application Sen No. 09/454,219, filed Dec. 9, 
1999, which claims priority to provisional Application No. 
60/154,903, filed Sep. 20, 1999; Application No. 60/156, 
266, filed Sep. 27, 1999; Application No. 60/157,470, filed 
Oct. 1, 1999; Application No. 60/160,124, filed Oct 18, 
1999; Application No. 60/161,152, filed Oct. 22, 1999; 
Application No. 60/162,315, filed Oct. 28, 1999; Applica- 
tion No. 60/163,169, filed Nov. 2, 1999; Application No. 
60/163,170, filed Nov. 2, 1999; Application No. 60/163,600; 
filed Nov. 4, 1999; Application No. 60/164,379, filed Nov. 9, 
1999; Application No. 60/164,690, filed Nov. 10, 1999; 
Application No. 60/164,689, filed Nov, 10, 1999; Applica- 
tion No. 60/166,289, filed Nov. 18, 1999, the priority of 
which is hereby claimed under 35 U.S.C. §§119(e), 120. 

The present application also claims priority under 35 
U.S.C. §119(e) to provisional Application No. 60/171,203, 
filed Dec. 15, 1999; Application No. 60/171,180, filed Dec. 
16, 1999; Application No. 60/171,169, filed Dec. 16, 1999; 
Application No. 60/171,184, filed Dec. 16, 1999, and Appli- 
cation No. 60/178,258, filed Jan. 25, 2000. 

All these applications are expressly incorporated herein 
by reference as though fully set forth in full. 

This application contains subject matter that is related to 
co-pending patent application Ser. No. 09/493,458, filed Jan. 
28, 2000; co-pending patent application Ser. No. 09/643, 
920, filed Aug. 23, 2000; co -pending patent application Ser. 
No. 09/692,554, filed Oct. 19, 2000; co-pending patent 
application Ser. No. 09/644,586, filed Aug. 23, 2000; 
co-pending patent application Ser, No. 09/643,921, filed 
Aug. 23, 2000; co-pending patent application Ser. No. 
09/653,261, filed Aug. 31, 2000; co-pending patent appli- 
cation Ser. No. 09/654,376, filed Sep. 1, 2000; co-pending 
patent application Ser. No. 09/533,022, filed Mar. 22, 2000; 
co-pending patent application Ser. No. 09/697,777, filed 
Oct. 26, 2000; co-pending patent application Ser. No. 
09/651,006, filed Aug. 29, 2000; and co-pending patent 
application Ser, No. 09/522,184, filed Mar. 9, 2000. 

FIELD OF THE INVENTION 

The present invention relates generally to telecommuni- 
cations systems, and more particularly, to a system for 
interfacing telephony devices with packet based networks. 

BACKGROUND OF THE INVENTION 

Telephony devices, such as telephones, analog fax 
machines, and data modems, have traditionally utilized 
circuit switched networks to communicate. With the current 
state of technology, it is desirable for telephony devices to 
communicate over the Internet, or other packet based net- 
works. Heretofore, an integrated system for interfacing 
various telephony devices over packet based networks has 
been difficult due to the different modulation schemes of the 
telephony devices. Accordingly, it would be advantageous to 
have an efficient and robust integrated system for the 
exchange of voice, fax data and modem data between 
telephony devices and packet based networks. 

SUMMARY OF THE INVENTION 

In one aspect of the present invention, a method of 
transmitting data includes negotiating a data rate between a 
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rate negotiator and a first telephony device, and renegotiat- 
ing the negotiated data rate between the rate negotiator and 
a system having a second telephony device to allow data 
transmission between the first and second telephony devices. 

5 In another aspect of the present invention, a method of 
establishing a data rate includes initializing a data rate, 
receiving a data rate from a first telephony device, setting a 
negotiated data rate based on the initial date rate and the data 
rate for the first telephony device, receiving a data rate from 

io a system, and setting a renegotiated data rate based on the 
negotiated data rate and the system data rate. 

In yet another aspect of the present invention, a method of 
negotiating synchronizing a data rate includes exchanging 
data rates between a first data exchange and a first telephony 

35 device, negotiating a first data rate based on the exchanged 
data rates between the first data exchange and the first 
telephony device, exchanging data rates between a second 
data exchange and a second telephony device, negotiating a 
second data rate based on the exchanged rates between the 

20 second data exchange and the second telephony device, 
exchanging the first and the second data rates over a packet 
based network, and negotiating a third data rate based on the 
exchanged first and second data rates. 
In yet a further aspect of the present invention, a data 

25 exchange includes a rate negotiator capable of negotiating a 
data rate with a first telephony device, and renegotiating the 
negotiated data rate with a system comprising a second 
telephony device to allow data transmission between the first 
and second telephony devices. 

In yet another aspect of the present invention, a signal 
transmission system includes a first telephony device having 
a data rate, a first data exchange having a data rate, a first rate 
negotiator which exchanges the data rates between the first 

35 data exchange and the first telephony device and negotiates 
a first data rate based on the exchanged data rates between 
the first data exchange and the first telephony device. A 
second telephony device having a data rate, a second data 
exchange having a data rate, and a second rate negotiator 

4(J which exchanges the data rates between the second data 
exchange and the second telephony device and negotiates a 
second data rate based on the exchanged data rates between 
the second data exchange and the second telephony device, 
wherein the first and the second rate negotiators cooperate to 

45 exchange the first and the second data rates and negotiate a 
third data rate based on the exchanged first and second data 
rates. A packet based network coupling the first data 
exchange to the second data exchange. 

In a further aspect of the present invention, a compute r- 

50 readable media embodying a program of instructions execut- 
able by a computer performs a method of transmitting data, 
the method including negotiating a data rate between a rate 
negotiator and a first telephony device, and renegotiating the 
negotiated data rate between the rate negotiator and a system 

55 having a second telephony device to allow data transmission 
between the first and second telephony devices. 

In yet a further aspect of the present invention, a 
computer-readable media embodying a program of instruc- 
tions executable by a computer performs a method of 

60 synchronizing a data rate, the method including initializing 
a data rate, receiving a data rate from a first telephony 
device, setting a negotiated data rate based on the initial date 
rate and the data rate for the first telephony device, receiving 
a data rate from a system, and setting a renegotiated data rate 

65 based on the negotiated data rate and the system data rate. 
In still yet a further aspect of the present invention, a data 
exchange includes negotiating means for negotiating a data 



07/13/2004, EAST Version: 1.4.1 



US 6,757,367 Bl 

3 4 

rate with a first telephony device, and renegotiating means of telephony devices with a packet based network in accor- 

for renegotiating the negotiated data rate with a system dance with a preferred embodiment of the present invention; 

comprising a second telephony device to allow data trans- FIG. 13 is a system block diagram of a signal processing 

mission between the first and second telephony devices. system operating in a modem relay mode in accordance with 

It is understood that other embodiments of the present 5 a preferred embodiment of the present invention; 

invention will become readily apparent to those skilled in FIG. 14 is a diagram of a relay sequence for V32bis rate 

the art from the following detailed description, wherein it is synchronization using rate re-negotiation in accordance with 

shown and described only embodiments of the invention by a preferred embodiment of the present invention; and 

way of illustration of the best modes contemplated for FIG 15 is a diagram of an alternate relay sequence for 

carrying out the invention. As will be realized, the invention io V37bis rate synchronization whereby rate signals are used 

is capable of other and different embodiments and its several to align lhe conncc tion rates at the two ends of the network 

details are capable of modification in various other respects, without rate ^negotiation in accordance with a preferred 

all without departing from the spirit and scope of the present embodiment of the present invention, 
invention. Although the the rate negotiator is described in 

the context of a data exchange, those skilled in the art will is DETAILED DESCRIPTION 

appreciate that the rate negotiator is likewise suitable for An Embodiment of a Signal Processing System 

various other telephony and telecommunications applica- In a preferred embodiment of the present invention, a 

tions. Accordingly, the drawings and detailed description are signal processing system is employed to interface telephony 

to be regarded as illustrative in nature and not as restrictive. devices with packet based networks Telephony devices 

20 include, by way of example, analog and digital phones, 

DESCRIPTION OF THE DRAWINGS ethernet phones, Internet Protocol phones, fax machines, 

These and other features, aspects, and advantages of the data modems, cable modems, interactive voice response 

present invention will become better understood with regard svstcms > PBXs > kc X s y stems > md an y othcr conventional 

to the following description, appended claims, and accom- telephony devices known in the art. The described preferred 

panying drawings where* 25 emD °diment of the signal processing system can be imple- 

HO. 1 is a block diagram of packet based infrastructure men,ed , wilh * 7™* of Zoologies including by way of 

providing a communication medium with a number of example, embedded commumcations software that enables 

telephony devices in accordance with a preferred embodi- ««™*™ <* v °><*> f« a " d ™ dc ( m over packet based 

, * ■ »■ networks. The embedded communications software is pref- 

ment of the present invention; L1 L1 . , . . r 

- . 30 erably run on programmable digital signal processors 

WO. 2 is a block diagram of a signal processing system and 

is used in gateways, cable modems, remote 

^ 1 ' me " ted Wlth a P r °S rammable digital signal processor access K pBXs and othef ^ based netWQrk 

(DSP) software architecture in accordance with a preferred appliances 

embodiment of the present invention; ^ exemplary topology is shown in FIG. 1 with a packet 

FIG. 3 is a block diagram of the software architecture 35 ne twork 10 providing a communication medium 

operating on the DSP platform of FIG. 2 in accordance with between various telephony devices. Each network gateway 

a preferred embodiment of the present invention; Ha, 26, tec includes a signal processing system which 

FIG. 4 is state machine diagram of the operational modes provides an interface between the packet based network 10 

of a virtual device driver for packet based network applica- and a number of telephony devices. In the described exem- 

tions in accordance with a preferred embodiment of the 40 plary embodiment, each network gateway 12a, 126, 12c 

present invention; supports a fax machine 14a, 146, 14c, a telephone 13a, 136, 

FIG. 5 is a block diagram of several signal processing 13 and a modem 15a, 156, 15c. Two of the network 

systems in the voice mode for interfacing a number of gateways 12a, 12b provide a direct interface between their 

telephony devices with a packet based network in accor- respective telephony devices and the packet based network 

dance with a preferred embodiment of the present invention; 45 10. The other network gateway 12c is connected to its 

FIG. 6 is a system block diagram of a signal processing respective telephony device through a public switched tele- 
system operating in a voice mode in accordance with a phone network (PSTN) 19. The network gateways 12a, 126, 
preferred embodiment of the present invention; 12c permit voice, fax and modem data to be carried over 

FIG. 7 is a block diagram of a method for obtaining voice P acke ' ba f d networks such as internet protocol (HP), frame 

parameters for future frame loss conditions in accordance 50 relay (FR), asynchronous transfer mode (AM), or any other 

with a preferred embodiment of the present invention; P acket b . ased s y stem -. . 

o • i_i i j- c iL j c . The signal processing system can be implemented with a 

FIG. 8 is a block diagram of a method for generating & , , r npn ? 3 L -. *. u • ttf^ 

r . . * ... to programmable DSP software architecture as shown in FIG. 

estimates of lost speech frames in accordance with a pre- % * . ... . , . 4U ie 4 iU 

f fi . t ■ y 2. This architecture has a DSP 17 with memory 18 at the 

ferred embodiment of the present invention; „ i_r.it. i • . _r Vn j * i 

... ,. c 1- , • 55 core > a number of network channel interfaces 19 and tele- 

FIG. 9 is a block diagram of several signal processing phony mter faces 20, and a host 21 that may reside in the DSP 

systems in the fax relay mode for interfacing a number of Uself or on a separate microcontrol i er . The network channel 

telephony devices with a packet based network in accor- interfaccs 19 provide multi-channel access to the packet 

dance with a preferred embodiment of the present invention; 5ased network . ^ telephony interfaces 23 can be con- 

FIG. 10 is a system block diagram of a signal processing 60 n ected to a circuit switched network, such as a PSTN line, 

system operating in a real time fax relay mode in accordance 0 r directly to any telephony device, 

with a preferred embodiment of the present invention; The embedded communications software binds all core 

FIG. 11 is a diagram of the message flow for a fax relay DSP algorithms together, interfaces the hardware to the host 

in non error control mode in accordance with a preferred 21, and provides low level services such as resource arbi- 

embodiment of the present invention; 65 tration and task management. An exemplary software archi- 

FTG. 12 is a block diagram of several signal processing tecture operating on a DSP platform is shown in FIG. 3. A 

systems in the modem relay mode for interfacing a number user application layer 26 provides overall executive control 
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and system management, and directly interfaces a DSP 
server 25 to the host 21 (see to FIG. 2). The DSP server 25 
provides DSP resource management and telecommunica- 
tions signal processing. The DSP server 25 communicates 
With external telephony devices (not shown) and the under- 
lying DSP 17 (see FIG. 2) via physical devices (PXD) 30a, 
306, 30c and a hardware abstraction layer (HAL) 34. 

The DSP server 25 includes a resource manager 24 which 
receives commands from, forwards events to, and exchanges 
data with the user application layer 26. The user application 
layer 26 can either be resident on the DSP 17 or alternatively 
on the host 21 (see FIG. 2), such as a microcontroller. An 
application programming interface 27 (API) provides a 
software interface between the user application layer 26 and 
the resource manager 24. The resource manager 24 manages 
the internal/external program and data memory of the DSP 15 
17. In addition the resource manager dynamically allocates 
DSP resources, performs command routing as well as other 
general purpose functions. 

The DSP server 25 also includes virtual device drivers 
(VHDs) 22a, 226, 22c. The VHDs are a collection of 20 
software algorithms that control the operation of and provide 
the facility for real time signal processing. Each VHD 22a, 
226, 22c includes an inbound and outbound media queue 
(not shown) and a library of signal processing services 
specific to that VHD 22a, 22b, 22c. In the described exem- 25 
plary embodiment, each VHD 22c, 226, 22c is a complete 
self-contained software module for processing a single chan- 
nel of voice, fax and modem. Multiple channel capability 
can be achieved by adding VHDs to the DSP server 25. The 
resource manager 24 dynamically controls the creation and 
deletion of VHDs and services. 

A switchboard 32 in the DSP server 25 dynamically 
inter-connects the PXDs 30a, 306,30c with the VHDs 22a, 
226, 22c providing multi-channel operation. Each PXD 30a, 
306, 30c is a collection of software algorithms which 
provide signal conditioning for one external telephony 
device. For example, a PXD may provide volume and gain 
control for telephony signals from its respective telephony 
device prior to communication with the switchboard 32. 
Voice, fax and modem functionalities can be supported on a 
single channel by connecting three PXDs, one for each 
telephony device, to a single VHD via the switchboard 32. 
Connections within the switchboard 32 are managed by the 
user application layer 26 via a set of API commands to the 
resource manager 24. The number of PXDs and VHDs is 
expandable, and limited only by the memory size and the 
MIPS (millions instructions per second) of the underlying 
hardware. 

A hardware abstraction layer (HAL) 34 exchanges tele- 
phony signals with the external telephony devices, and 
interfaces directly with the underlying DSP 17 hardware (see 
FIG. 2). The HAL 34 includes basic hardware interface 
routines, including DSP initialization, target hardware 
control, codec sampling, and hardware control interface 
routines. The DSP initialization routine is invoked by the 
user application layer 26 to initiate the initialization of the 
signal processing system. The DSP initialization sets up the 
internal registers of the signal processing system for 
memory organization, interrupt handling, timer 
initialization, and DSP configuration. Target hardware ini- 
tialization involves the initialization of all hardware devices 
and circuits external to the signal processing system. The 
HAL 34 is a physical firmware layer that isolates the 
communications software from the underlying hardware. 
This methodology allows the communications software to 
be ported to various hardware platforms by porting only the 
affected portions of the HAL 34 to the target hardware. 
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In operation, the user application layer 26 creates, opens, 
issues commands to, and processes events from the VHDs 
22a, 226, 22c via API commands to the resource manager 
24. In response, each VHD 22a, 226, 22c may invoke certain 
services which perform signal processing algorithms on 
telephony signals via the PXDs 30a, 306, 30c. For example, 
when a call comes in, a VHD 22a will be automatically 
opened by the resource manager 24 to handle the call. The 
VHD 22a will then communicate to the user application 
layer 26 that a call is coming in. The user application layer 
26 will respond to this information by opening a new VHD 
226, invoking the appropriate services, and commanding the 
switchboard 32 to route the incoming call between the 
appropriate PXD 306 and the VHD 226. An executive 28 
schedules the execution of the VHDs 22a, 226, 22c and their 
associated services according to assigned priorities, and 
controls the multi-tasking function of the services for each 
VHD 22a, 226, 22c. The executive 28 also communicates in 
real time the instruction cycle consumption of each VHD 
22a, 226, 22c and services to resource manager 24. The 
resource manager 24 may reallocate DSP resources as a 
result. 

The exemplary software architecture described above can 
be integrated into numerous telecommunications products. 
In a presently preferred embodiment, the software architec- 
ture is designed to support telephony signals between the 
traditional circuit switched network and the packet based 
infrastructure, A network VHD is used to support each 
channel of this operation. 

Turning to FIG. 4, an exemplary network VHD includes 
three operational modes, namely voice mode 36, fax relay 
mode 40, and modem relay mode 42. FIG, 4 shows the 
various services that are running in each operational mode. 
In the voice mode 36, call discrimination 44, packet voice 
exchange 48, and packet tone exchange 50 are running. In 
the fax relay mode 40, packet fax data exchange 52 is 
running. And in the modem relay mode 42, packet data 
modem exchange 54 is running, The network VHD controls 
each of the services including instantiation and removal. 

In the described exemplary embodiment, the network 
VHD is open and initialized to the voice mode 36 of 
operation by the user application layer 26 (see FIG. 3) via 
API commands to the resource manager 24 (see FIG. 3). The 
call discriminator 44 is responsible for differentiating 
between a voice and machine call by detecting the presence 
of a 2100 Hz. tone (as in the case when the telephony device 
is a fax or a modem), a 1100 Hz. tone or V.21 channel two 
modulated high level data link control (HDLC) flags (as in 
the case when the telephony device is a fax). If a 1100 Hz. 
tone, or V21 modulated HDLC flags are detected, a calling 
fax machine is recognized. The network VHD then termi- 
nates the voice mode 36 and invokes the packet fax data 
exchange service 52 to process the call. If however, 2100 
Hz. tone is detected, the network VHD terminates voice 
mode 36 and invokes the packet data modem exchange 
service 54. 

The packet data modem exchange service 54 further 
differentiates between a fax and modem by analyzing the 
incoming signal to determine whether V.21 modulated 
HDLC flags are present indicating that a fax connection is in 
progress. If HDLC flags are detected, the network VHD 
terminates packet data modem exchange service 54 and 
initiates packet fax data exchange service 52. Otherwise, the 
packet data modem exchange service 54 remains operative. 
In the absence of an 1100 or 2100 Hz. tone, or V.21 
modulated HDLC flags the voice mode 36 remains opera- 
tive. 
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A. The Voice Mode be applied to the sampled outgoing PCM signal 60b. In the 

Voice mode provides signal processing of voice signals. described exemplary embodiment, the AGC 74 is designed 

As shown in the exemplary embodiment depicted in FIG. 5, to adapt slowly, although it should adapt fairly quickly if 

voice mode enables the transmission of voice over a packet overflow or clipping is detected. The AGC adaptation should 

based system such as Voice over IP (VoIP, H. 323), Voice 5 be held fixed if the NLP 72 is activated, 

over Frame Relay (VoFR, FRF-11), Voice Telephony over After AGC, the digital voice samples are placed in the 

ATM (VTOA), or any other proprietary network. The voice media queue 66 in the network VHD 62 via the switchboard 

mode should also permit voice to be carried over traditional 32'. In the voice mode, the network VHD 62 invokes three 

media such as time division multiplex (TDM) networks and services, namely call discrimination, packet voice exchange, 

voice storage and playback systems. Network gateway 55a 10 and packet tone exchange. The call discriminator 68 ana- 

supports the exchange of voice between a traditional circuit lyzes the digital voice samples from the media queue to 

switched 58 and a packet based network 56. In addition, determine whether a 2100, a 1100 Hz. tone or V.21 modu- 

network gateways 556, 55c, 55d, S5e support the exchange lated HDLC flags are present. As described above with 

of voice between the packet based network 56 and a number reference to FIG. 4, if either tone or HDLC flags are 

of telephones 57a, 57b, 57c, 57 d, 51 e. Although the 35 detected, the voice mode services are terminated and the 

described exemplary embodiment is shown for telephone appropriate service for fax or modem operation is initiated, 

communications across the packet based network, it will be In the absence of a 2100, a 1100 Hz. tone, or HDLC flags, 

appreciated by those skilled in the art that other telephony the digital voice samples are coupled to the encoder system 

devices could be used in place of one or more of the which includes a voice encoder 82, a voice activity detector 

telephones. 20 (VAD) 80, a comfort noise estimator 81, a DTMF detector 

Trie PXDs for the voice mode provide echo cancellation, 76, and a packetization engine 78. 
gain, and automatic gain control. The network VHD invokes Typical telephone conversations have as much as sixty 
numerous services in the voice mode including call percent silence or inactive content. Therefore, high band- 
discrimination, packet voice exchange, and packet tone width gains can be realized if digital voice samples are 
exchange. These network VHD services operate together to 25 suppressed during these periods. A VAD 80, operating under 
provide: (1) an encoder system with DTMF detection, voice the packet voice exchange service, is used to accomplish this 
activity detection, voice compression, and comfort noise function. The VAD 80 attempts to detect digital voice 
estimation, and (2) a decoder system with delay samples that do not contain active speech. If the comfort 
compensation, voice decoding, DTMF generation, comfort noise estimator 81 can accurately regenerate parameters for 
noise generation and lost frame recovery. 30 the digital voice samples without speech, silence identifier 

The services invoked by the network VHD in the voice (SID) packets will be coupled to a packetization engine 78. 

mode and the associated PXD is shown schematically in The SID packets contain voice parameters that allow the 

FIG. 6. In the described exemplary embodiment, the PXD 60 reconstruction of the background noise at the far end. 

provides two way communication with a telephone or a From a system point of view, the VAD 80 may be 

circuit switched network, such as a PSTN line carrying a 64 35 sensitive to the change in the NLP 72. For example, when 

kb/s pulse code modulated (PCM) signal, i.e., digital voice the NLP 72 is activated, the VAD 80 may immediately 

samples. declare that voice is inactive. In that instance, the VAD 80 

Tne incoming PCM signal 60a is initially processed by may have problems tracking the true background noise 

the PXD 60 to remove far end echos. As the name implies, level. If the echo canceller 72 generates comfort noise, it 

echos in telephone systems is the return of the talker's voice 40 may have a different spectral characteristic from the true 

resulting from the operation of the hybrid with its two-four background noise. The VAD 80 may detect a change in noise 

wire conversion. If there is low end-to-end delay, echo from character when the NLP 72 is activated (or deactivated) and 

the far end is equivalent to side-tone (echo from the near- declare the comfort noise as active speech. For these 

end), and therefore, not a problem. Side-tone gives users reasons, the VAD 80 should be disabled when the NLP 72 is 

feedback as to how loud they are talking, and indeed, 45 activated. This is accomplished by a "NLP on" message 72a 

without side-tone, users tend to talk too loud. However, far passed from the NLP 72 to the VAD 80. 

end echo delays of more than about 10 to 30 msec signifi- The voice encoder 82, operating under the packet voice 

cantly degrade the voice quality and is a major annoyance to exchange service, can be a straight 16 bit PCM encoder or 

the user. any voice encoder which support one or more of the stan- 

An echo canceller 70 is used to remove echos from far end 50 dards promulgated by ITU. The encoded digital voice 
speech present on the incoming PCM signal 60a before samples are formatted into a voice packet (or packets) by the 
routing the incoming PCM signal 60a back to the far end packetization engine 78. These voice packets are formatted 
user. The echo canceller 70 samples an outgoing PCM signal according to an applications protocol and outputted to the 
60£> from the far end user, filters it, and combines it with the host (not shown). The voice encoder 82 is invoked only 
incoming PCM signal 60a. Preferably, the echo canceller 70 55 when digital voice samples with speech are detected by the 
is followed by a non-linear processor (NLP) 72 which may VAD 80. Since the packetization interval may be a multiple 
mute the digital voice samples when far end speech is of an encoding interval, both the VAD 80 and the packeti- 
detected in the absence of near end speech. The echo zation engine 78 should cooperate to decide whether or not 
canceller 70 may also inject comfort noise which may be the voice encoder 82 is invoked. For example, if the pack- 
roughly at the same level as the true background noise or at 60 etization interval is 10 msec and the encoder interval is 5 
a fixed level. msec (a frame of digital voice samples is 5 ms), then a frame 

After echo cancellation, the power level of the digital containing active speech will cause the subsequent frame to 

voice samples is normalized by an automatic gain control be placed in the 10 ms packet regardless of the VAD state 

(AGC) 74 to ensure that the conversation is of an acceptable during that subsequent frame. This interaction can be 

loudness. Alternatively, the AGC can be performed before 65 accomplished by the VAD 80 passing an "active" flag 80a to 

the echo canceller 70, however, this approach would entail the packetization engine 78, and the packetization engine 78 

a more complex design because the gain would also have to controlling whether or not the voice encoder 82 is invoked. 
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In the described exemplary embodiment, the VAD 80 is isochronous stream to the voice decoder 96. The voice 

applied after the AGC 74. This approach provides optimal decoder 96 decodes the voice frames into digital voice 

flexibility because both the VAD 80 and the voice encoder samples suitable for transmission on a circuit switched 

82 are integrated into some speech compression schemes network, such as a 64 kb/s PCM signal for a PSTN line. The 

such as those promulgated in ITU Recommendations G.729 5 output of the voice decoder 96 (or the comfort noise gen- 

with Annex B VAD (March 1996) — Coding of Speech at 8 erator 92 or lost packet recovery engine 94 if enabled) is 

kbits/s Using Conjugate-Structure Algebraic-Code-Exited written into a media queue 106 for transmission to the PXD 

Linear Prediction (CS-ACELP), and G.723. 1 with Annex A 60. 

VAD (March 1996) — Dual Rate Coder for Multimedia The comfort noise generator 92 provides background 
Communications Transmitting at 5.3 and 6,3 kbit/s, the 10 noise to the near end user during silent periods. The back- 
contents of which is hereby incorporated by reference as ground noise is reconstructed by the comfort noise generator 
through set forth in full herein. 92 from the voice parameters in the SID packets from the 

Operating under the packet tone exchange service, a voice queue 86. However, the comfort noise generator 92 

DTMF detector 76 determines whether or not there is a should not be dependent upon SID packets from the far end 

DTMF signal present at the near end. The DTMF detector 76 is for proper operation. In the absence of SID packets, the 

also provides a pre-detection flag 76 a which indicates voice parameters of the background noise at the far end can 

whether or not it is likely that the digital voice sample might be determined by running the VAD 98 at the voice decoder 

be a portion of a DTMF signal, If so, the pre-detection flag 96 in series with a comfort noise estimator 100. 

76 a is relayed to the packetization engine 78 instructing it If the protocol supports SID packets, (and these are 

to begin holding voice packets. If the DTMF detector 76 20 supported for VTOA, FRF-11, and VoIP), the comfort noise 

ultimately detects a DTMF signal, the voice packets are estimator 81 should transmit SID packets. However, for 

discarded, and the DTMF signal is coupled to the packeti- some protocols, namely, FRF-11, the SID packets are 

zation engine 78. Otherwise the voice packets are ultimately optional, and other far end users may not support SID 

released from the packetization engine 78 to the host (not packets at all. In these systems, the voice synchronizer 90 

shown). The benefit of this method is that there is only a 25 must continue to operate properly. The voice synchronizer 

temporary impact on voice packet delay when a DTMF 90 can invoke a number of mechanisms to compensate for 

signal is pre-detected in error, and not a constant buffering delay jitter in these systems if sequence numbers are not 

delay. Whether voice packets are held while the pre- embedded in the voice packet. For example, the voice 

detection flag 76a is active could be adaptively controlled by synchronizer 90 can assume that the voice queue 86 is in an 

the user application layer. 30 underflow condition due to excess jitter and perform packet 

The decoding system of the network VHD 62 essentially repeats by enabling the lost frame recovery engine 94. 

performs the inverse operation of the encoding system. The Alternatively, the VAD 98 at the voice decoder 96 can be 

decoding system of the network VHD 62 comprises a used to estimate whether or not the underflow of the voice 

depacketizing engine 84, a voice queue 86, a DTMF queue queue 86 was due to the onset of a silence period or due to 

88, a voice synchronizer 90, a DTMF synchronizer 102, a 35 packet loss. In this instance, the spectrum and/or the energy 

voice decoder 96, a VAD 98, a comfort noise estimator 100, of the digital voice signals can be estimated and the result 

a comfort noise generator 92, a lost packet recovery engine 98a fed back to the voice synchronizer 90. The voice 

94, and a tone generator 104. synchronizer 90 can then invoke the lost packet recovery 

The depacketizing engine 84 identifies the type of packets engine 94 during voice packet losses and the comfort noise 

received from the host (i.e., voice packet, DTMF packet, 40 generator 92 during silent periods. 

SID packet), transforms them into frames which is protocol When DTMF packets arrive, they are depacketized by the 

independent, transfers the voice frames (or voice parameters depacketizing engine 84. DTMF frames at the output of the 

in the case of SID packets) into the voice queue 86, and depacketizing engine 84 are written into the DTMF queue, 

transfers the DTMF frames into the DTMF queue 88. In this The DTMF synchronizer 102 couples the DTMF frames 

manner, the remaining tasks are, by and large, protocol 45 from the DTMF queue 88 to the tone generator 104, Much 

independent. like the voice synchronizer, the DTMF synchronizer 102 is 

A jitter buffer 87 is utilized to compensate for network employed to provide an isochronous stream of DTMF 

impairments such as delay jitter caused by packets not frames to the tone generator 104. Generally speaking, when 

arriving at the same time or in the same order in which they DTMF packets are being transferred, voice frames should be 

were transmitted. In addition, the jitter buffer 87 compen- 50 suppressed. To some extent, this is protocol dependent, 

sates for lost packets that occur on occasion when the However, the capability to flush the voice queue 86 to ensure 

network is heavily congested. In the described exemplary that the voice frames do not interfere with DTMF generation 

embodiment, the jitter buffer 87 includes a voice synchro- is desirable. Essentially, old voice frames which may be 

nizer 90 that operates in conjunction with a voice queue 86 queued are discarded when DTMF packets arrive. This will 

to provide an isochronous stream of voice frames to the 55 ensure that there is a significant inter-digit gap before DTMF 

voice decoder 96. tones are generated. This is achieved by a "tone present" 

Sequence numbers embedded into the voice packets at the message 88a passed between the DTMF queue and the voice 

far end can be used to detect lost packets, packets arriving synchronizer 90. 

out of order, and short silence periods. The voice synchro- The tone generator 104 converts the DTMF signals into a 
nizer 90 can analyze the sequence numbers, enabling the 60 DTMF tone suitable for a standard digital or analog tele- 
comfort noise generator 92 during short silence periods and phone. The tone generator 104 overwrites the media queue 
performing voice frame repeats via the lost packet recovery 106 to prevent leakage through the voice path and to ensure 
engine 94 when voice packets are losL SID packets can also that the DTMF tones are not too noisy, 
be used as an indicator of silent periods causing the voice There is also a possibility that DTMF tone may be fed 
synchronizer 90 to enable the comfort noise generator 92. 65 back as an echo into the DTMF detector 76. To prevent false 
Otherwise, during far end active speech, the voice synchro- detection, the DTMF detector 76 can be disabled entirely (or 
nizer 90 couples voice frames from the voice queue 86 in an disabled only for the digit being generated) during DTMF 
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tone generation. This is achieved by a "tone on" message conservative mode, a skilled listener may be able to deter- 
104a passed between the tone generator 104 and the DTMF mine that voice activity detection and comfort noise gen- 
detector 76, Alternatively, the NLP 72 can be activated while e ration is being employed. 

generating DTMF tones. In a fourth "aggressive" mode, bandwidth is at a pre- 

The outgoing PCM signal in the media queue 106 is 5 mium. The VAD is aggressive about discarding digital voice 

coupled to the PXD 60 via the switchboard 32'. The outgo- samples which are declared inactive. This approach will 

ing PCM signal is coupled to an amplifier 108 before being result in speech being occasionally clipped, but system 

outputted on the PCM output line 60k bandwidth will be vastly improved. 

1. Echo Canceller with NLP The transparent mode is typically the default mode when 
In an exemplary embodiment, the echo canceller can be 10 the system is operating with 16 bit PCM, companded PCM 

an adaptive filter which tries to model the transfer charac- (G.711) or adaptive differential PCM (ITU Recommenda- 

teristics of the hybrid and the tail circuit of the telephone tions G.726 (December 1990) — 40, 32, 24, 16 kbit/s Using 

circuit. The tail length supported should be at least 16 msec. Low- Delay Code Exited Linear Prediction, and G.727 

The adaptive filter can be a linear transversal filter or any (December 1990) — 5-, 4-, 3-, and 2-Sample Embedded 

other suitable filter. With the linear transversal filter, the 35 Adaptive Differential Pulse Code Modulation). In these 

echo canceller may be unable to cancel all of the resulting instances, the user is most likely concerned with high quality 

echo due to the non-linearities in the hybrid and tail circuit. voice since a high bit-rate voice encoder (or decoder) has 

Thus, the NLP is used to suppress the remaining echo during been selected. As such, a high quality VAD should be 

periods of far end active speech with no near end speech. employed. The transparent mode should also be used for the 

The NLP can be implemented with a suppressor that sup- 20 VAD operating in the decoder system since bandwidth is not 

presses down to the background noise level, or suppresses a concern (the VAD in the decoder system is used only to 

completely and inserts comfort noise with the spectrum update the comfort noise parameters). The conservative 

which models the true background noise. Preferably, the mode could be used with ITU Recommendation G.728 

echo canceller is compatible with one or more of the (September 1992) — Coding of Speech at 16 kbit/s Using 

following ITU Recommendations G.164 (1988) — Echo 25 Low-Delay Code Excited Linear Prediction, G.729, and 

Suppressors, G.165 (March 1993) — Echo Cancellers, and G.723,1. For systems demanding high bandwidth efficiency, 

G.168 (April 1997) — Digital Network Echo Cancellers, the the aggressive mode can be employed as the default mode, 

contents of which are incorporated herein by reference as The mechanism in which the VAD detects digital voice 

though set forth in full. samples that do not contain active speech can be imple- 

2. Automatic Gain Control 30 mented in a variety of ways. One such mechanism entails 
In an exemplary embodiment, the AGC can be either fully monitoring the energy level of the digital voice samples over 

adaptive or have a fixed gain. Preferably, the AGC supports short periods (where a period length is typically in the range 

a fully adaptive operating mode with a range of about -30 of about 10 to 30 msec). If the energy level exceeds a fixed 

dB to 30 dB. A default gain value can be independently threshold, the digital voice samples are declared active, 

established, and is typically 0 dB. If adaptive gain control is 35 otherwise they are declared inactive. The transparent mode 

used, the initial gain value is specified by this default gain. can be obtained when the threshold is set to the threshold 

3. Voice Activity Detector level of hearing. 

In an exemplary embodiment, the VAD, in either the Alternatively, the threshold level of the VAD can be 

encoder system or the decoder system, can be configured to adaptive and the background noise energy can be tracked. If 

operate in multiple modes so as to provide system tradeoffs 40 the energy in the current period is sufficiently larger than the 

between voice quality and bandwidth requirements. In a first background noise estimate by the comfort noise estimator, 

mode, the VAD is always disabled and declares all digital the digital voice samples are declared active, otherwise they 

voice samples as active speech. This mode is applicable if are declared inactive. The VAD may also freeze the comfort 

the signal processing system is used over a TDM network, noise estimator or extend the range of active periods 

a network which is not congested with traffic, or when used 45 (hangover). This type of VAD is used in GSM (European 

with PCM (ITU Recommendation G.711 (1988)— Pulse Digital Cellular Telecommunications System; Halfrate 

Code Modulation (PCM) of Voice Frequencies, the contents Speech Part 6: Voice Activity Detector (VAD) for Half Rate 

of which is incorporated herein by reference as if set forth Speech Traffic Channels (GSM 6.42), the contents of which 

in full) in a PCM bypass mode. is incorporated herein by reference as if set forth in full) and 

In a second "transparent" mode, the voice quality is 50 QCELP (W. Gardner, P. Jacobs, and C. Lee, "QCELP: A 

indistinguishable from the first mode. In transparent mode, Variable Rate Speech Coder for CDMA Digital Cellular," in 

the VAD identifies digital voice samples with an energy Speech and Audio Coding for Wireless and Network 

below the threshold of hearing as inactive speech. The Applications, B. S. atal, V. Cupennan, and A. Gersho (eds)., 

threshold may be adjustable between -90 and -40 dBm with the contents of which is incorporated herein by reference as 

a default value of -60 dBm default value. For loud back- 55 if set forth in full). 

ground noise which is rich in character such as music on In a VAD utilizing an adaptive threshold level, speech 

hold, background music, or loud background talkers (so- parameters such as the zero crossing rate, spectral tilt, 

called cocktail noise), the threshold can be adjustable energy and spectral dynamics are measured and compare 

between -90 and -20 dBm with a default value of -20 dBm. stored values for noise. If the parameters differ significantly 

The transparent mode may be used if voice quality is much 60 from the stored values, it is an indication that active speech 

more important than bandwidth. This may be the case, for is present even if the energy level of the digital voice 

example, if a G.711 voice encoder (or decoder) is used. samples is low. 

In a third "conservative" mode, the VAD identifies low When the VAD operates in the conservative or transparent 

level (but audible) digital voice samples as inactive, but will mode, measuring the energy of the digital voice samples can 

be fairly conservative about discarding the digital voice 65 be sufficient for detecting inactive speech. However, the 

samples. A low percentage of active speech will be clipped spectral dynamics of the digital voice samples may be useful 

at the expense of slightly higher transmit bandwidth. In the in discriminating between long voice segments with audio 
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spectra and long term background noise. In an exemplary 
embodiment of a VAD employing spectral analysis, the VAD 
performs auto-correlations using Itakura or Itakura-Saito 
distortion to compare long term estimates based on back- 
ground noise to short term estimates based on a period of 5 
digital voice samples. In addition, if supported by the voice 
encoder, line spectrum pairs (LSPs) can be used to compare 
long term LSP estimates based on background noise to short 
terms estimates based on a period of digital voice samples. 
Alternatively, FFT methods can be are used when the 10 
spectrum is available from another software module. 

Preferably, hangover should be applied to the end of 
active periods of the digital voice samples with active 
speech, Hangover bridges short inactive segments to ensure 
that quiet trailing, unvoiced sounds (such as /s/), are clas- 15 
sified as active. The amount of hangover can be adjusted 
according to the mode of operation of the VAD. If a period 
following a long active period is clearly inactive (i.e., very 
low energy with a spectrum similar to the measured back- 
ground noise) the length of the hangover period can be 20 
reduced. Generally, a range of about 40 to 300 msec of 
inactive speech following an active speech burst will be 
declared active speech due to hangover. 

4. Comfort Noise Generator 

A comfort noise generator plays noise. In an exemplary 25 
embodiment, a comfort noise generator in accordance with 
ITU standards G.729 Annex B or G.723.1 Annex can be 
used, These standards specify background noise levels and 
spectral content. 

Alternatively, SID packets are not used or the contents of 30 
the SID packet are unspecified (see FRF- 11) or the SID 
packets only contains an energy estimate, then estimating 
the parameters of the noise in the decoding system may be 
necessary. With this methodology, voice frames are decoded 
by the voice decoder and coupled to the VAD 98. The VAD 35 
98 does not need to be invoked when comfort noise is being 
generated. Comfort noise parameters should not be esti- 
mated or updated by the comfort noise estimator during 
frame repeats or during periods in which comfort noise is 
being is being generated by the comfort noise generator. 40 

Hie far end voice encoder should ensure that a relatively 
long hangover period is used in order to ensure that there are 
noise-only digital voice samples which the VAD decoder can 
identify as inactive speech. During the identified inactive 
periods, the digital voice samples from the voice decoder are 45 
used to update the comfort noise parameters of the comfort 
noise estimator. A mixed mode may also be employed 
whereby the energy is conveyed in a SID packet and the 
spectrum is estimated in the decoder system. Alternatively, 
if it is unknown whether or not the far end voice encoder 50 
supports (sending) SID packets, the decoder system can start 
with the assumption that SID packets are not being sent, and 
then only use the comfort noise parameters contained in the 
SID packets if and when a SID packet arrives. 

Alternatively, the comfort noise estimate could be 55 
updated with the two or three digital voice frames which 
arrived immediately prior to the SID packet. The far end 
voice encoder should then ensure that at least two or three 
frames of inactive speech are transmitted before the SID 
packet is transmitted. This can be realized by extending the 60 
hangover period. 

The comfort noise parameters at the near end are mea- 
sured by the comfort noise estimator in the encoding system 
and transferred to the far end decoder in SID packets. The 
VAD determines whether the digital voice samples in the 65 
media queue 66 contain active speech. If the VAD deter- 
mines that the digital voice samples do not contain active 



speech, then the energy and spectrum of a digital voice 
sample period is used to update a long running background 
noise energy and spectral estimate. These estimates are 
periodically quantized and transmitted in a SID packet by 
the comfort noise estimator (usually at the end of a talk spurt 
and periodically during the ensuing silent segment, or when 
the background noise parameters change appreciably). The 
comfort noise estimator should update the long running 
averages, when necessary, decide when to transmit a SID 
packet, and quantize and pass the quantized parameters to 
the packetization engine. SID packets should not be sent 
while on-hook, unless they are required to keep the perma- 
nent virtual connection between the telephony devices alive. 
There may be multiple quantization methods depending on 
the protocol chosen. 
5. Voice Encoder/Voice Decoder 

In an exemplary embodiment, the voice encoder and the 
voice decoder support one or more voice compression 
algorithms, including but not limited to, 16 bit PCM (non- 
standard, and only used for diagnostic purposes); ITU-T 
standard G.711 at 64 kb/s; G.723.1 at 5.3 kb/s (ACELP) and 
6.3 kb/s (MP-MLQ); ITU-T standard G.726 (ADPCM) at 
16, 24, 32, and 40 kb/s; ITU-T standard G.727 (Embedded 
ADPCM) at 16, 24, 32, and 40 kb/s; ITU-T standard G.728 
(LD-CELP) at 16 kb/s; and ITU-T standard G.729 Annex A 
(CS-ACELP) at 8 kb/s. 

The packetization interval for 16 bit PCM, G.711, G.726, 
G.727 and G.728 should be a multiple of 5 msec. The 
packetization interval is the time duration of the digital voice 
samples that are encapsulated into a single voice packet. The 
voice encoder (decoder) interval is the time duration in 
which the voice encoder (decoder) is enabled. The packeti- 
zation interval should be an integer multiple of the voice 
encoder (decoder) interval. By way of example, G.729 
encodes frames containing 80 digital voice samples at 8 kHz 
which is equivalent to a voice encoder (decoder) interval of 
10 msec. If two subsequent encoded frames of digital voice 
sample are collected and transmitted in a single packet, the 
packetization interval in this case would be 20 msec. 

G.711, G.726, and G.727 encodes digital voice samples 
on a sample by sample basis. Hence, the minimum voice 
encoder (decoder) interval is 0.125 msec. This is somewhat 
of a short voice encoder (decoder) interval, especially if the 
packetization interval is a multiple of 5 msec. Therefore, a 
single voice packet will contain 40 frames of digital voice 
samples. 

G.728 encodes frames containing 5 digital voice samples 
(or 0.625 msec). A packetization interval of 5 msec (40 
samples) can be supported by 8 frames of digital voice 
samples. 

G.723.1 compresses frames containing 240 digital voice 
samples. The voice encoder (decoder) interval is 30 msec, 
and the packetization interval should be a multiple of 30 
msec. 

Packetization intervals which are not multiples of the 
voice encoder (or decoder) interval can be supported by a 
change to the packetization engine or the depacketization 
engine. This may be acceptable for a voice encoder (or 
decoder) such as G.711 or 16 bit PCM, but the packetization 
interval should be a multiple of the voice encoder or decoder 
frame size. 

The G.728 standard may be desirable for some applica- 
tions. G.728 is used fairly extensively in proprietary voice 
conferencing situations and it is a good trade-off between 
bandwidth and quality at a rate of 16 kb/s. Its quality is 
superior to that of G.729 under many conditions, and it has 
a much lower rate than G.726 or G.727. However, G.728 is 
MIPS intensive. 
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Differentiation of various voice encoders (or decoders) process voice packets that arrive out of order, and to 

may come at a reduced complexity. By way of example, both dynamically switch between voice encoding methods (i.e. 

G.723.1 and G.729 could be modified to reduce complexity, between, for example, G.723.1 and G.711). Voice packets 

enhance performance, or reduce possible IPR conflicts. should be queued such that it is easy to identify the voice 

Performance may be enhanced by using the voice encoder 5 frame to be released, and easy to determine when voice 

(or decoder) as an embedded coder. For example, the "core" packets have been lost or discarded en route. 

voice encoder (or decoder) could be G.723.1 operating at 5.3 ^ voice <l ueue ma y rec l uire significant memory to queue 

kb/s with "enhancement" information added to improve the * e voice P acke j s ; B ^ wa * of exam P lc ' * G ™ * and 

voice quality. The enhancement information may be dis- th u e ™ st case ^ vanatl0n » ^° msec ' th / V0ice f ^ ueue 

_ . , „, ' „ a _ nt nr%tl . t ■ t . „. 4 l tUa , n should be capable of storing up to 500 msec of voice frames. 

carded at the source or a any point m the network, with the id m a ^ J q q{ ^ ^ £ fe P transtotes ^ ^ b tes 

quality reverting to that of the core voice encoder (or Qr m {u Wt) wQrds of s simUnly, for 16 bit PCM, 

decoder). Embedded coders can be implemented since they 500 m ^ c of ^ ffames 4K ^ Umiting ^ 

are based on a given core Embedded coders are rate amount of memory required may limit the worst case delay 

scalable, and are well suited for packet based networks. If a var i at i on 0 f 16 bit PCM and possibly G.711 This, however, 

higher quality 16 kb/s voice encoder (or decoder) is 15 depends on how the voice frames are queued, and whether 

required, one could use G.723.1 or G.729 Annex A at the dynamic memory allocation is used to allocate the memory 

core, with an extension to scale the rate up to 16 kb/s (or for the voice frames. Thus, it is preferable to optimize the 

whatever rate was desired). memory allocation of the voice queue. 

The configurable parameters for each voice encoder or The voice queue transforms the voice packets into frames 

decoder include the rate at which it operates (if applicable), 20 of digital voice samples. If the voice packets are at the 

which companding scheme to use, the packetization interval, fundamental encoding interval of the voice frames, then the 

and the core rate if the voice encoder (or decoder) is an delay jitter problem is simplified. In an exemplary 

embedded coder. For G.727, the configuration is in terms of embodiment, a double voice queue is used. The double voice 

bits/sample. For example EADPCM(5,2) (Embedded queue includes a secondary queue which time stamps and 

ADPCM, G.727) has a bit rate of 40 kb/s (5 bits/sample) 25 temporarily holds the voice packets, and a primary queue 

with the core information having a rate of 16 kb/s (2 which holds the voice packets, time stamps, and sequence 

bits/sample). numbers. The voice packets in the secondary queue are 

6. Packetization Engine disassembled before transmission to the primary queue. The 
In an exemplary embodiment, the packetization engine secondary queue stores packets in a format specific to the 

groups voice frames from the voice encoder, and with 30 particular protocol, whereas the primary queue stores the 

information from the VAD, creates voice packets in a format packets in a format which is largely independent of the 

appropriate for the packet based network. The two primary particular protocol. 

voice packet formats are generic voice packets and SID In practice, it is often the case that sequence numbers are 

packets. The format of each voice packet is a function of the included with the voice packets, but not the SID packets, or 

voice encoder used, the selected packetization interval, and 35 a sequence number on a SID packet is identical to the 

the protocol. sequence number of a previously received voice packet. 

Those skilled in the art will readily recognize that the Similarly, SID packets may or may not contain useful 

packetization engine could be implemented in the host. information. For these reasons, it may be useful to have a 

However, this may unnecessarily burden the host with separate queue may be provided for received SID packets, 

configuration and protocol details, and therefore, if a com- 40 The depacketizing engine is preferably configured to 

plete self contained signal processing system is desired, then support VoIP, VTOA, VoFR and other proprietary protocols, 

the packetization engine should be operated in the network The voice queue should be memory efficient, while provid- 

VHD. Furthermore, there is significant interaction between ing the ability to dynamically switch between voice encod- 

the voice encoder, the VAD, and the packetization engine, ers (at the far end), allow efficient reordering of voice 

which further promotes the desirability of operating the 45 packets (used for VoIP) and properly identify lost packets, 

packetization engine in the network VHD. 8. Voice Synchronization 

The packetization engine may generate the entire voice In an exemplary embodiment, the voice synchronizer 

packet or just the voice portion of the voice packet. In analyzes the contents of the voice queue and determines 

particular, a fully packetized system with all the protocol when to release voice frames to the voice decoder, when to 

headers may be implemented, or alternatively, only the voice 50 play comfort noise, when to perform frame repeats (to cope 

portion of the packet will be delivered to the host. By way with lost voice packets or to extend the depth of the voice 

of example, for VoIP, it is reasonable to create the RTP queue), and when to perform frame deletes (in order to 

encapsulated packet with the packetization engine, but have decrease the size of the voice queue). The voice synchro- 

the remaining TCP/IP stack residing in the host. In the nizer manages the asynchronous arrival of voice packets, 

described exemplary embodiment, the voice packetization 55 For those embodiments which are not memory limited, a 

functions reside in the packetization engine. The voice voice queue with sufficient fixed memory to store the largest 

packet should be formatted according to the particular possible delay variation is used to process voice packets 

standard, although not all headers or all components of the which arrive asynchronously. Such an embodiment includes 

header need to be constructed. sequence numbers to identify the relative timings of the 

7. Voice Depacketizing Engine/Voice Queue 60 voice packets. The voice synchronizer should ensure that the 
In an exemplary embodiment, voice de -packetization and voice frames from the voice queue can be reconstructed into 

queuing is a real time task which queues the voice packets high quality voice, while minimizing the end-to-end delay, 

with a time stamp indicating the arrival time. The voice These are competing objectives so the voice synchronizer 

queue should accurately identify packet arrival time within should be configured to provide system trade-off between 

one msec resolution. Resolution should preferably not be 65 voice quality and delay. 

less than the encoding interval of the far end voice encoder. Preferably, the voice synchronizer is adaptive rather than 

The depacketizing engine should have the capability to fixed based upon the worst case delay variation. This is 
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especially true in cases such as VoIP where the worst case be accomplished by passing two voice frames to the voice 

delay variation can be on the order of a few seconds. By way decoder in one decoding interval but only one of the voice 

of example, consider a VoIP system with a fixed voice frames is transferred to the media queue, 

synchronizer based on a worst case delay variation of 300 The voice synchronizer must also function under condi- 

msec. If the actual delay variation is 280 msec, the signal 5 tions of severe buffer overflow, where the physical memory 

processing system operates as expected. However, if the of the signal processing system is insufficient due to exces- 

actual delay variation is 20 msec, then the end-to-end delay sive delay variation. When subjected to severe buffer 

is at least 280 msec greater than required. In this case the overflow, the voice synchronizer could simply discard voice 

voice quality should be acceptable, but the delay would be frames. 

undesirable. On the other hand, if the delay variation is 330 10 The voice synchronizer should operate with or without 
msec then an underflow condition could exist degrading the sequence numbers, time stamps, SID packets, voice packets 
voice quality of the signal processing system. arriving out of order and lost voice packets. In addition, the 
The voice synchronizer performs four primary tasks. voice synchronizer preferably provides a variety of configu- 
First, the voice synchronizer determines when to release the ration parameters which can be specified by the host for 
first voice frame of a talk spurt from the far end. Subsequent 15 optimum performance, including minimum and maximum 
to the release of the first voice frame, the remaining voice target holding time. With these two parameters, it is possible 
frames are released in an isochronous manner. In an exem- to use a fully adaptive jitter buffer by setting the minimum 
plary embodiment, the first voice frame is held for a period target holding time to zero msec and the maximum target 
of time that is equal or less than the estimated worst case holding time to 500 msec (or the limit imposed due to 
jitter. 20 memory constraints). Although the preferred voice synchro- 
Second, the voice synchronizer estimates how long the nizer is fully adaptive and able to adapt to varying network 
first voice frame of the talk spurt should be held. If the voice conditions, those skilled in the art will appreciate that the 
synchronizer underestimates the required "target holding voice synchronizer can also be maintained at a fixed holding 
time/' jitter buffer underflow will likely result. However, time by setting the minimum and maximum holding times to 
jitter buffer underflow could also occur at the end of a talk 25 be equal. 

spurt, or during a short silence interval. Therefore, SID 9. Lost Packet Recovery/Frame Deletion 

packets and sequence numbers could be used to identify The lost packet recovery engine can be configured to 

what caused the jitter buffer underflow, and whether the provide frame insertion, and frame deletion cap ability for all 

target holding time should be increased. If the voice syn- voice decoders under consideration. For G.729 Annex A and 

chronizer overestimates the required "target holding time/' 30 G.723.1, the lost frame recovery mechanism can be part of 

all voice frames will be held too long causing jitter buffer the voice decoder. The same mechanism may be used for 

overflow. In response to jitter buffer overflow, the target frame insertion. Frame deletion can be realized by simply 

holding time should be decreased. In the described exem- passing two consecutive voice frames to the voice decoder 

plary embodiment, the voice synchronizer increases the in the same decoding interval, and discarding one of the 

target holding time rapidly for jitter buffer underflow due to 35 voice frames. In this manner, the end to end delay will be 

excessive jitter, but decreases the target holding time slowly decreased in time by one decoding interval, 

when holding times are excessive. This approach allows The frame deletion mechanism can likewise be fully 

rapid adjustments for voice quality problems while being integrated into both G.723.1 and G.729 Annex A. This 

more forgiving for excess delays of voice packets. reduces the complexity of the frame deletion mechanism and 

Thirdly, the voice synchronizer provides a methodology 40 allows voice frames to be discarded over a longer interval to 
by which frame repeats and frame deletes are performed improve the overall quality. However, since the frame dele- 
within the voice decoder. Estimated jitter is only utilized to tion is a low probability event, the short term impact on 
determine when to release the first frame of a talk spurt. voice quality should be minor. Alternatively, a non- 
Therefore, changes in the delay variation during the trans- integrated frame deletion mechanism can also be used, 
mission of a long talk spurt must be independently mom- 45 For voice decoders other than G.723.1 and G729 Annex 
tored. On buffer underflow (an indication that delay varia- A, it is desirable to have a method to handle lost voice 
tion is increasing), the voice synchronizer instructs the lost packets and to implement a frame insertion scheme, 
frame recovery engine to issue voice frames repeats. In However, the likelihood of requiring a frame insertion is 
particular, the frame repeat command instructs the lost frame typically low and the position of the frame insertion can be 
recover engine to utilize the parameters from the previous 50 selected based on decoded voice energy. This allows the 
voice frame to estimate the parameters of the current voice frame insertion mechanism to be realized through the use of 
frame. Thus, if frames 1, 2 and 3 are normally transmitted the lost frame recovery mechanism, whereby the frames 
and frame 3 arrives late, frame repeat is issued after frame from a lost voice packet are simply inserted between con- 
number 2, and if frame number 3 arrives during this period, secutive voice frames. In other words, between frame n and 
it is then transmitted. The sequence would be frames 1, 2, a 55 n+1, a frame loss is inserted. This effectively increases the 
frame repeat and then frame 3. Performing frame repeats end to end delay by one decoding interval, 
causes the delay to increase, which increasing the size of the Similarly, voice packet loss for voice telephony over ATM 
jitter buffer so as to cope with increasing delay character- and voice over FR should also be a low probability event, 
istics during long talk spurts. Frame repeats are also issued However, for voice over IP frame losses can be excessive. In 
to replace voice frames that are lost en route. 60 fact, in TCP/IP congestion can be mitigated by having 

Conversely, if the holding time is too large due to decreas- routers discard voice packets. When end points detect the 

ing delay variation, the speed at which voice frames are voice discarded packets, they typically will reduce their 

released should be increased. Typically, the target holding transmission rate. If the network begins to get congested, 

time can be adjusted, which automatically compresses the voice packet losses (which can get quite high) will occur, 

following silent interval. However, during a long talk spurt, 65 Thus, an efficient frame loss recovery mechanism is desired 

it may be necessary to decrease the holding time more to maintain reasonably high quality during voice packet 

rapidly to minimize the excessive end to end delay. This can losses. 
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Lost voice frames can be estimated by first estimating the There are numerous problems involved with the trans- 
pitch period based on digital voice samples contained in the mission of DTMF in band over a packet based network. For 
previous frames, and then repeating the previous excitation example, lossy voice decoding may distort a valid DTMF 
to an LPC filter delayed by one (or possible more) pitch tone or sequence into an invalid sequence. Also voice packet 
periods. An exemplary embodiment for estimating the pitch 5 losses of digital voice samples may corrupt DTMF 
period and excitation during previous good voice frames is seque nces and delay variation (jitter) may corrupt the DTMF 
shown in FIG. 7 Normally, when a voice frame is available timin m f orma tion and lead to lost digits. The severity of the 
from the voice decoder (or comfort noise generator 92), the various blems d dfi on the yoicc dccod 

LPC is estimated based on a frame of current plus past • I , , * 4 . . , . , , , , 

, . . 4 , t , • j i 4 l c the voice decoder rate, the voice packet loss rate, the delay 

digital voice samples (over a window length in the range of . , , . , , 4 . £ , 

about 20 to 30 msec). The digital voice samples over the 10 Variatl0n ' and he P^rUcular implementaUon of the signal 

decoding interval is then passed through a LPC inverse filter P™*™J8 - applications such as VoIP with 

110 to obtain the LPC residual. The residual (both current Potentially significant delay variation, high voice packet loss 

and past) or perhaps a combination of the residual and past rates ' and low V01CC sam P le rate ( tf G.723.1 is used), 

digital voice samples is used to obtain a pitch estimate using, P acket tone exchange is desirable. Packet tone exchange is 

for example, a pitch estimator 112 or correlation measure- 15 also desirable for VoFR (FRF-11, class 2). 

ment. In fact, a pitch estimator similar to that used in G.729 DTMF events are preferably reported to the host. This 

Annex A may be used. In this instance, pitch doubling is not allows the host, for example, to convert the DTMF sequence 

a serious problem since this lost frame recovery system is of keys to a destination address. It will, therefore, allow the 

only used in an attempt to recover a lost voice packet. host to support call routing via DTMF. 

Typically, past residuals should be stored in a buffer 114 of 20 Depending on the protocol, the packet tone exchange 

about at least 120 to 160 digital voice samples, and a pitch service might support muting of the received digital voice 

period range of between (about) 20 and 140 digital voice samples, or discarding voice frames when DTMF is 

samples should be analyzed. detected. Note that the voice packets may be queued (but not 

During a voice packet loss condition, the residual used to released) in the encoder system when DTMF is pre -detected, 

excite the LPC synthesis filter 116 is estimated by selecting 25 If the detection was false (invalid), the voice packets are 

a scaled residual from one (or more) pitch periods in the past ultimately released, otherwise they are discarded. This will 

(Z~ M ) 118. The pitch period is that which was estimated in manifest itself as occasional jitter when DTMF is falsely 

the previous good voice frame. Referring to FIG. 8, a gain detected. 

adjuster 120 slowly increases the gain to reduce the output Software to route calls via DTMF can be resident on the 

energy during multiple frame loss conditions. If the voice 30 host or within the signal processing system. Essentially, the 

packet loss condition extends for more than 40 or 50 msec, packet tone exchange traps DTMF tones and reports them to 

the resulting digital voice samples should be significantly the host or a higher layer. In an exemplary embodiment, the 

muted, and the signal processing system should switch from packet tone exchange will generate dial tone when an 

issuing frame losses to generating comfort noise. (This off-hook condition is detected. Once a DTMF digit is 

control should be placed in the voice synchronizer which 35 detected, the dial tone is terminated. The packet tone 

controls when the voice decoder, comfort noise generator, exchange may also have to play ringing tone back to the near 

and lost packet recovery engine are invoked). During a voice end user (when the far end phone is being rung), and a busy 

packet loss condition the estimated residual is saved in the tone if the far end phone is unavailable. Other tones may also 

past residual buffer 114 to ensure that for multiple frame need to be supported to indicate all circuits are busy, or an 

losses from one or more voice packets a past residual is still 40 invalid sequence of DTMF digits were entered, 

available. If a strong pitch component is not identified, B. The Fax Relay Mode 

rather than repeating past excitation delayed by the esti- Fax relay mode provides signal processing of fax signals, 
mated (best) pitch period a random (gaussian, for example) As shown in FIG. 9, fax relay mode enables the transmission 
excitation can be used to excite the LPC synthesis filter 116. of fax signals over a packet based system such as VoIP, 
The random excitation should be scaled such that the power 45 VoFR, FRF-11, VTOA, or any other proprietary network, 
is slightly less than that in the last good voice frame. The fax relay mode should also permit data signals to be 
The capability of the voice decoder should be considered carried over traditional media such as TDM. Network gate- 
when selecting the lost packet recovery engine 94, For voice ways 132a, 1326, 132c, the operating platform for the signal 
decoder's which are less MIPS intensive, such as G.726, processing system in the described exemplary embodiment, 
G.727 and G.711, the added complexity of the lost packet 50 support the exchange of fax signals between a packet based 
recovery engine would not increase the complexity to that of network 56 and various fax machines 134a, 1346, 134c. For 
say G.729 Annex A or G. 723.1. The lost frame recovery the purposes of explanation, the first fax machine is a 
engine should preferably be on the order of 1 MIP, or less. sending fax 134a. The sending fax 134a is connected to the 
For more complex voice decoders such as G.728, the sending network gateway 132a through a PSTN line 130. 
parameters used for lost voice packet recovery (LPC filter 55 The sending network gateway 132a is connected to a packet 
and pitch period) are known at the voice decoder. The lost based network 131. Additional fax machines 134fc, 134c are 
frame recovery mechanism could be integrated directly into at the other end of the packet based network 131 and include 
G.728. This is a lower complexity solution, and is preferred receiving fax machines 134b, 134c and receiving network 
for G.728. gateways 132fc>, 132c. The receiving network gateways 
10. DTMF 60 132b, 1326 provide a direct interface between their respec- 
There are two functions performed by DTMF. The first tive fax machines 134fc, 134c and the packet based network 
function performs call routing and the second function 131. 

performs DTMF relay. The transfer of fax data signals over packet based net- 

DTMF (dual-tone, multi-frequency) tones are signaling works can be accomplished by three alternative methods. In 

tones carried within the audio band. DTMF is used for 65 the first method, fax data signals are exchanged in real time, 

dialing, interactive voice response systems (IVR), and for Typically, the sending and receiving fax machines are 

PBX to PBX or PBX to central office signaling. spoofed to allow transmission delays plus jitter of up to 
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about 1.2 seconds. The second, store and forward mode, is re-modulation system, and the packet fax data exchange 

a non real time method of transferring fax data signals. service includes the ability to transmit called station tone 

Typically, the fax communication is transacted locally, (CED) and calling station tone (CNG) to support fax setup, 

stored into memory and transmitted to the destination fax The demodulation system further includes a receive fax 

machine at a subsequent time. The third mode is a combi- 5 data P um P I 56 which demodulates the fax data signals 

nation of store and forward mode with minimal spoofing to durin S the data transfer phase. The receive fax data pump 

provide an approximate emulation of a typical fax connec- 156 supports the V.27ter standard for fax data signal transfer 

£ 0Q at 2400/4800 bps, the V.29 standard for fax data signal 

In the fax relay mode, the network VHD invokes the * ran f r at . 7200/9600 ^^^^ M ^^^^^ r 

packet fax data exchange service in the fax relay mode. The id ax Jf S1 f «J ^ nsfer at 7200/9600/12000/14400 bps. The 

r , irji , & . j j j V.34 fax standard, once approved, may also be supported, 

packet fax data exchange service provides demodulation and ^ JM ^ ^ 15Q e ^ ablcs/disab y les lsQd thc 7eceive 

re-modulation of fax data signals. Tins approach resu ts in fax data > w * ^ accordance ^ me UoQ of the 

considerable bandwidth savings since only the underlying fax data signals or {ht X30 mcssages> 

unmodulated data signals are transmitted across the packet If error correction mode (ECM) is required, receive ECM 

based network. The packet fax data exchange service also is relay logic 158 performs high level data link control( HDLC 

provides compensation for network jitter with a jitter buffer )de-framing, including bit de-stuffing and preamble removal 

similar to that invoked in the packet voice exchange service. 0 n ECM frames contained in the data packets. The resulting 

Additionally, the packet fax data exchange service compen- fax data signals are then packetized by the packetization 

sates for lost data packets with error correction processing. engine 152 and communicated across the packet based 

Spoofing may also be provided during various stages of the 20 network. The T.30 relay logic 150 selectively enable/ 

procedure between the fax machines to keep the connection disables 150e the receive ECM relay logic 158 in accor- 

alive. dance with the error correction mode of operation. 

The packet fax data exchange service is divided into two In the re-modulation system, if required, incoming data 

basic functional units, a demodulation system and a packets are first translated from a T.38 packet format to a 

re-modulation system. In the demodulation system, the 25 protocol independent format by the T.38 packet shim 154. 

network VHD exchanges fax data signals from a circuit The data packets are then de-packetized by a depacketizing 

switched network, or a fax machine, to the packet based engine 162. The data packets may contain T.30 messages or 

network. In the re-modulation system, the network VHD fax data signals. The T.30 relay logic 150 reformats the 

exchanges fax data signals from the packet network to the remote T.30 indications 150 f and forwards the resulting 

switched circuit network to a circuit switched network, or a 30 T.30 indications to the local fax machine (not shown) via the 

fax machine directly. V21 data pump 148. The modulated output of the V.21 data 

During real time relay of fax data signals over a packet pump 148 is forwarded to an egress media queue 164 for 

based network, the sending and receiving fax machines are transmission in either analog format or after suitable 

spoofed to accommodate network delays plus jitter. conversion, as 64 kbps PCM samples to the local fax device 

Typically, the packet fax data exchange service can accom- 35 over a circuit switched network, such as for example a PSTN 

modate a total delay of up to about 1.2 seconds. Preferably, line. 

the packet fax data exchange service supports error correc- De-packetized fax data signals are transferred from the 

tion mode (ECM) relay functionality, although a full ECM depacketizing engine 162 to a jitter buffer 166. If error 

implementation is typically not required. In addition, the correction mode (ECM) is required, transmitting ECM relay 

packet fax data exchange service should preferably preserve 40 logic 168 performs HDLC de-framing, including bit stuffing 

the typical call duration required for a fax session over a and preamble addition on ECM frames. The transmitting 

GSTN/ISDN when exchanging fax data signals over a ECM relay logic 168 forwards the fax data signals, (in the 

network appropriate format) to a transmit fax data pump 170 which 

The packet fax data exchange service for the real time modulates the fax data signals and outputs 8 KHz digital 

exchange of fax data signals between a circuit switched 45 samples to the egress media queue 164. The T.30 relay logic 

network and a packet based network is shown schematically selectively enables/disables (150 g) the transmit ECM relay 

in FIG. 10. In this exemplary embodiment, a connecting logic 168 in accordance with the error correction mode of 

PXD (not shown) connecting the fax machine to the switch operation. 

board 32' is transparent, although those skilled in the art will The transmit fax data pump 170 supports the V27ter 

appreciate that various signal conditioning algorithms could 50 standard for fax data signal transfer at 2400/4800 bps, the 

be programmed into PXD such as echo cancellation and V29 standard for fax data signal transfer at 7200/9600 bps, 

gain. as well as the V.17 standard for fax data signal transfer at 

After the PXD (not shown), the incoming fax data signal 7200/9600A2000/14400 bps. The T.30 relay logic selec- 

146a is coupled to the demodulation system of the packet lively enables/disables (150ft) the transmit fax data pump 

fax data exchange service operating in the network VHD via 55 170 in accordance with the transmission of the fax data 

the switchboard 32'. The incoming fax data signal 146a is signals or the T.30 message samples, 

received and buffered in an ingress media queue 146. A V.21 If the jitter buffer 166 underflows, a buffer low indication 

data pump 148 demodulates incoming T.30 message so that 166a is coupled to spoofing logic 172. Upon receipt of a 

T.30 relay logic 150 can decode the received T.30 messages buffer low indication during the fax data signal transmission, 

150a. Local T.30 indications 1506 are packetized by a 60 the spoofing logic 172 inserts "spoofed data" at the appro- 

packetization engine 152 and if required, translated into T.38 priate place in the fax data signals via the transmit fax data 

packets via a T.38 shim 154 for transmission to a remote fax pump 170 until the jitter buffer 166 is rilled to a pre- 

device (not shown) across the packet based network. The determined level, at which time the fax data signals are 

V.21 data pump 148 is selectively enabled/disabled 150c by transferred out of the jitter buffer 166. Similarly, during the 
the T.30 relay logic 150 in accordance with the reception/ 65 transmission of the T.30 message indications, the spoofing 

transmission of the T.30 messages or fax data signals. The logic 172 can insert "spoofed data" at the appropriate place 

V.21 data pump 148 is common to the demodulation and in the T.30 message samples via the V.21 data pump 148. 
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1. Data Rate Management 

An exemplary embodiment of the packet fax data 
exchange service complies with the T.38 recommendations 
for real-time Group 3 facsimile communication over IP 
networks. In accordance with the T3S standard, the pre- 
ferred system should therefore, provide packet fax data 
exchange service support at both the T.30 level (see ITU 
Recommendation T.30 — " Procedures for Document Fac- 
simile Transmission in the General Switched Telephone 
Network", 1988) and the T4 level (see ITU Recommenda- 
tion T.4 — "Standardization of Group 3 Facsimile Apparatus 
For Document Transmission", 1998), the contents of each of 
these ITU recommendations being incorporated herein by 
reference as if set forth in full. One function of the packet fax 
data exchange service is to relay the set up (capabilities) 
parameters in a timely fashion. Spoofing may be needed at 
either or both the T.30 and T.4 levels to maintain the fax 
session while set up parameters are negotiated at each of the 
network gateways and relayed in the presence of network 
delays and jitters. 

In accordance with the industry T.38 recommendations 
for real time Group 3 communication over Internet Protocol 
(IP) networks, the described exemplary embodiment relays 
all information including; T.30 preamble indications (flags), 
T.30 message data, as well as T.30 image data between the 
network gateways. The T.30 relay logic 150 in the sending 
and receiving network gateways then negotiate parameters 
as if connected via a PSTN line. The T.30 relay logic 150 
interfaces with the V.21 data pump 148 and the transmit and 
receive data pumps 156 and 170 as well as the packetization 
engine 152 and the depacketizing engine 162 to ensure that 
the sending and the receiving fax machines 130 and 134 
successfully and reliably communicate. The T.30 relay logic 
150 provides local spoofing, using command repeats (CRP), 
and internal automatic repeat request (ARQ) mechanisms to 
handle delays associated with the packet based network. In 
addition, the T.30 relay logic 150 intercepts control mes- 
sages to ensure compatibility of the rate negotiation between 
the near end and far end machines including HDLC 
processing, as well as lost packet recovery according to the 
T.30 ECM standard. 

FIG. 11 demonstrates message flow over a packet based 
network between a sending fax machine 134a (see FIG. 9) 
and the receiving fax device 134£> (see FIG. 9) in non-ECM 
mode. The sending fax machine dials the sending network 
gateway 132a (see FIG. 9) which forwards CNG (not 
shown) to the receiving network gateway 132b (see FIG. 9). 
The receiving network gateway responds by alerting the 
receiving fax machine. The receiving fax machine answers 
the call and sends CED 230 tones. The CED tones are 
detected by the V.21 data pump 148 of the receiving network 
gateway which issues an event 232 indicating the receipt of 
CED which is then relayed to the emitting network gateway. 
In addition, the V21 data pump of the receiving network 
gateway invokes the packet fax data exchange service. 

The receiving network gateway now transmits T.30 pre- 
amble (HDLC flags) 234 followed by called subscriber 
identification (CSI) 236 and digital identification signals 
(DSI) 238. The emitting network gateway, receives a com- 
mand 240 to begin transmitting CED. Upon receipt of CSI 
and DSI, the emitting network gateway begins sending 
subscriber identification (TSI) 242, digital command signal 
(DCS) 244 followed by training check (TCF) 246. The TCF 
246 can be managed by one of two methods. The first 
method, referred to as the data rate management method one 
in T.38 , generates TCF locally by the receiving gateway. 
CFR is returned to the sending fax machine 250, when the 
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emitting network gateway receives a confirmation to receive 
(CFR) 248 from the receiving fax machine via the receiving 
network gateway, and the TCF training 246 from the sending 
fax machine is received successfully. In the event that the 

5 receiving fax machine receives a CFR and the TCF training 
246 from the sending fax machine subsequently fails, then 
DCS 244 from the sending fax machine is again relayed to 
the receiving fax machine. The TCF training 246 is repeated 
until an appropriate rate is established which provides 

10 successful TCF training 246 at both ends of the network. 
In a second method to synchronize the data rate, referred 
to as the data rate management method 2 in the T.38 
standard, the TCF data sequence received by the emitting 
network gateway are forwarded from the sending fax 

15 machine to the receiving fax machine via the receiving 
network gateway. The sending and receiving fax machines 
and then perform speed selection as if connected via a 
regular PSTN. 

Upon receipt of confirmation to receive (CFR) 250, the 

20 sending fax machine, transmits image data 254 along with 
its training preamble 252. The emitting network gateway 
receives the image data and forwards the image data 254 to 
the receiving network gateway. The receiving network gate- 
way then sends its own training preamble 256 followed by 

25 the image data 258 to the receiving fax machine. 

After each image page end of page (EOP), an EOP 260 
and message confirmation (MCF) 262 messages are relayed 
between the sending and receiving fax machines. At the end 
of the final page, the receiving fax machine sends a message 

30 confirmation (MCF) 262, which prompts the sending fax 
machine to transmit a disconnect (DCN) signal 264. The call 
is then terminated at both ends of the network. 

ECM fax relay message flow is similar to that described 
above. All preambles, messages and phase C HDLC data are 

35 relayed through the packet based network. Phase C HDLC 
data is de -stuffed and, along with the preamble and frame 
checking sequences (FCS), removed before being relayed so 
that only fax image data itself is relayed over the packet 
based network. The receiving network gateway performs bit 

40 stuffing and reinserts the preamble and FCS. 
2. Spoofing Techniques 

In the described exemplary embodiment, spoofing tech- 
niques are utilized at the T.30 and T.4 levels to manage 
extended network delays and jitter. Turning back to FIG. 10, 

45 the spoofing logic 172 includes built in timeouts for auto- 
matic requests for retransmission (ARQ). Automatic tim- 
eouts ensure that the connection is maintained in a system 
impaired by delay. T.30 spoofing is used to reset the T4 
timer, defined in accordance with the ITU T.30 

50 recommendations, to prevent a command or response 
retransmission. The T.30 relay logic 150 waits for a response 
to any transmitted message or command before continuing 
to the next state or phase. The T.30 relay logic 150 packages 
each message or command into a HDLC frame which 

55 includes preamble flags. 

The sending and receiving network gateways 134a, 1346 
(See FIG. 9) spoof their respective fax machines 134a, 134b 
by locally transmitting preamble flags if a response from the 
packet based network is not received prior to T4 time out 

60 (3 ±0.15 sec). Preferably, the waiting period is less than 
about 2.7 sec, which has been empirically demonstrated to 
eliminate activation of the T4 timer for most fax machines. 
In addition, the maximum length of the preamble is limited 
to about 4.5 seconds. If a response from the packet based 

65 network arrives before the spoofing time out, each network 
gateway should preferably transmit a response message to 
its respective fax machine following the preamble flags. 
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Each network gateway repeats the spoofing technique until 
a successful handshake is completed or its respective fax 
machine disconnects, 

T.4 spoofing handles delay impairments during phase C 
signal reception. The composition of the phase C signal 5 
depends on whether ECM is being used, so that an appro- 
priate spoofing method must be implemented for each mode. 
For those systems that do not utilize ECM, phase C signals 
consist of a series of coded image data followed by fill bits 
and end-of-line (EOL) sequences. Typically, fill bits are 10 
zeros inserted between the fax data signals and the EOL 
sequences. Fill bits ensure that a fax machine has time to 
perform the various mechanical overhead functions associ- 
ated with any line it receives. Fill bits can also be utilized to 
spoof the jitter buffer in accordance with a spoofing method 1S 
known as EOL spoofing. The number of the bits of coded 
image contained in the data signals associated with the scan 
line and transmission speed limit the number of fill bits that 
can be added to the data signals. Preferably, the maximum 
transmission of any coded scan line is limited to less than 2 o 
about 5 sec. Thus, if the coded image for a given scan line 
contains 1000 bits and the transmission rate is 2400 bps, 
then the maximum duration of fill time is (5 -(1000+ 12)/ 
2400>4.57 sec. 

Generally, the packet fax data exchange service utilizes 2 s 
spoofing if the network jitter delay exceeds the delay capa- 
bility of the jitter buffer 166. In accordance with the EOL 
spoofing method, fill bits can only be inserted immediately 
before an EOL sequence, so that by necessity, the jitter 
buffer 166 must store at least one EOL sequence. Thus the 30 
jitter buffer 166 must be sized to hold at least one entire scan 
line of data to ensure the presence of at least one EOL 
sequence within the jitter buffer 166. Thus, depending upon 
transmission rate, the size of the jitter buffer 166 can become 
prohibitively large. The table below summarizes the 35 
required jitter buffer data space to perform EOL spoofing for 
various scan line lengths. The table assumes that each pixel 
is represented by a single bit. The values represent an 
approximate upper limit on the required data space, but not 
the absolute upper limit, because in theory at least, the 40 
longest scan line can consist of alternating black and white 
pixels which would require an average of 4.5 bits to repre- 
sent each pixel rather than the one to one ratio summarized 
in the table. 



Scan 


Number 


sec to 


sec to 


sec to 


sec to 


line 


of 


print out 


print out 


print out 


print out 


Length 


words 


at 2400 


at 4800 


at 9600 


at 14400 


1728 


108 


0.72 


0.36 


0.18 


0.12 


2048 


128 


0.853 


0.427 


0.213 


0.14 


2432 


152 


1.01 


0.507 


0.253 


0.17 


3456 


216 


1.44 


0.72 


0.36 


0.24 


4096 


256 


2 


0.853 


0.43 


0.28 


4864 


304 


2.375 


1.013 


0.51 


0.34 



50 



55 



To ensure the jitter buffer 166 stores an EOL sequence the 
spoofing logic 172 is activated when the number of data 
packets stored in the jitter buffer 166 drops to a threshold 
level. Typically, a threshold value of about 200 msec is used 60 
to support the most commonly used fax setting, namely a fax 
speed of 9600 bps and scan line length of 1728. An alternate 
spoofing method should be used if an EOL sequence is not 
contained within the jitter buffer 166, otherwise the call will 
have to be terminated. An alternate spoofing method uses 65 
zero run length code words. This method requires real time 
image data decoding so that the word boundary is known. 
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Advantageously, this alternate method reduces the required 
size of the jitter buffer 166. 

In error correction mode, phase C signals consist of 
HDLC frames so that HDLC spoofing can be used. The jitter 
buffer 166 must be sized to store at least one HDLC frame 
so that a frame boundary may be located. The length of the 
largest T.4 ECM HDLC frame is 260 octets or 130 16-bit 
words. Again, spoofing is activated when the number of 
packets stored in the jitter buffer 166 drops to a predeter- 
mined threshold level. When spoofing is required, the spoof- 
ing logic 172 adds HDLC flags at the frame boundary as a 
complete frame is being reassembled and forwarded to the 
transmit fax data pump 170. This continues until the number 
of data packets in the jitter buffer 166 exceeds the threshold 
level. 

Simply increasing the storage capacity of the jitter buffer 
166 can minimize the need for spoofing. However, overall 
network delay increases when the size of jitter buffer 166 is 
increased. This delay may complicate the T.30 negotiation at 
the end of page or end of document, because of suscepti- 
bility to time out. Such a situation arises when the sending 
fax machine completes the transmission of high speed data, 
and switches to an HDLC phase and sends the first V.21 
packet in phase D. The sending fax machine must be kept 
alive until the response to the V.21 data packet is received. 
The receiving fax device requires more time to flush a large 
jitter buffer 166 and then respond, hence complicating the 
T.30 negotiation. 

In addition, the length of time a fax machine can be 
spoofed is limited, so that the jitter buffer 166 can not be 
arbitrarily large. A pipelined store and forward relay is a 
combination of store and forward and spoofing techniques to 
approximate the performance of a typical Group 3 fax 
connection when the network delay is large (on the order of 
seconds or more). One approach is to store and forward a 
single page at a time. However, this approach requires a 
significant amount of memory (10 K words or more). One 
approach to reduce the amount of memory required entails 
discarding scan lines on the sending network gateway and 
performing line repetition on the receiving network gateway 
so as to maintain image aspect ratio and quality. 
Alternatively, a partial page can be stored and forwarded 
thereby reducing the required amount of memory. 

The sending and receiving fax machines will have some 
minimal differences in clock frequency. ITU standards rec- 
ommends a data pump data rate of ±100 ppm, so that the 
clock frequencies between the receiving and sending fax 
machines could differ by up to 200 ppm. Therefore, the data 
rate at the receiving network gateway (jitter buffer 166) can 
build up or deplete at a rate of 1 word for every 5000 words 
received. Typically a fax page is less than 1000 words so that 
end to end clock synchronization is not a problem. 

C. Data Relay Mode 

Data relay mode provides signal processing of data sig- 
nals. As shown in FIG. 12, data relay mode enables the 
transmission of data signals over a packet based system such 
as VoIP, VoFR, FRF-11, VTOA, or any other proprietary 
network. The data relay mode should also permit data 
signals to be carried over traditional media such as TDM. 
Network gateways 1820,1826, 182c, the operating platform 
for the signal processing system in the described exemplary 
embodiment, support the exchange of data signals between 
a packet based network 181 and various data modems 180a, 
lS0b f 180c. For the purposes of explanation, the first modem 
is a calling modem 180a. The calling modem 180a is 
connected to the calling network gateway 182a through a 
PSTN line. The calling network gateway 182a is connected 
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to a packet based network 181. Additional modems 180b, the packet based network via a depacketizing engine 206. 

180c are at the other end of the packet based network 181 The call negotiator 200 relays ANSam answer tones and 

and include answer modems 180b, 180c and answer network other indications 200d to a local modem (not shown) via an 

gateways 1826, 182c. The answer network gateways 182b, egress media queue 212 of the modulation system. The call 

182c provide a direct interface between their respective 5 negotiator 200 relays ANSam answer tones and other indi- 

modem 180b, 180c and the packet based network 181. cations 200e t0 lhe remote modem via a packetization 

In data relay mode, a local modem connection is estab- engine 204. 

lished on each end of the packet based network 181. That is, t A data pump receiver 202 demodulates the data signals 

the calling modem 180a and the calling network gateway %om the ingress media queue 198. The data pump receiver 

182a establish a local modem connection, as does the 10 f f J^^bis sta ndard for the demodulation of 

j , ■ > • , 10A , j ' . data signals at 1200/2400 bps; the V.32bis standard for the 

destination answer . modem 180b and its respective answer demodulation of data si H al ; at 4800/7200/9600/12000/ 

network gateway 182b. Next data signals are relayed across 144QQ ^ weU ^ ^ ^ for ^ demodulation 

foe packet based network 181. The calling network gateway of data s{gmh up to 33m bps Moreovcr7 me V 90 standard 

182a demodulates the modem data signal and generates a may also be SU p port ed. The demodulated data signals are 

formatted signal appropriate for the packet based network is men pac ketized by the packetization engine 204 and trans- 

181. The answer network gateway 182b compensates for milled across the packet based network, 

network impairments and re-modulates the encoded data in i n the re-modulation system, packets of data signals from 

a format suitable for the destination answer modem 180b. the packet based network are first de-packetized by the 

This approach results in considerable bandwidth savings depacketizing engine 206 and stored in a jitter buffer 208. A 

since only the underlying unmodulated data signals are 20 data pump transmitter 210 modulates the buffered data 

transmitted across the packet based network. signals with a voiceband carrier. The modulated samples are 

In the data relay mode, the packet data modem exchange in turn stored in the egress media queue 212 before being 

service provides demodulation and modulation of data sig- output to the PXD (not shown) via the switchboard 32'. The 

nals. The packet data modem exchange also provides com- data pump transmitter 210 supports the V.22bis standard for 

pensation for network jitter with a jitter buffer similar to that 25 tne transfer of data signals at 1200/2400 bps; the V32bis 

invoked in the packet voice exchange service. Additionally, standard for the transfer of data signals at 4800/7200/9600/ 

the packet data modem exchange service compensates for 12000/14400 bps, as well as the V. 34 standard for the 

system clock jitter between the near end and far end modems transfc ' of daU * lg ? al Up t0 33 *°° bpS " Morcover > the Y9 ° 

with a dynamic phase adjustment and resampling mecha- * b ° ff( f r ^ow, the jitter buffer 208 sends 

nism. Spoofing may also be provided during various stages 30 a buffef ^ indicatioQ 2ft8fl tQ fiQ J ^ m whea ^ 

of the call negotiation procedure between the modems to spoofing logic 2U recdves the buffer bw signal indicating 

keep the connection alive. mat tne jiUer 5uffer 208 ^ opera ting below a pre-determined 

The packet data modem exchange service invoked by the threshold level, it inserts spoofed data at the appropriate 

network VHD in the data relay mode is shown schematically p i ace j n tne data signal via the data pump transmitter 210. 

in FIG. 13. In the described exemplary embodiment, a 35 Spoofing continues until the jitter buffer 208 is filled to the 

connecting PXD (not shown) connecting the modem to the pre-determined threshold level, at which time data signals 

switch board 32' is transparent, although those skilled in the are again transferred from the jitter buffer 208 to the data 

art will appreciate that various signal conditioning algo- pump transmitter 210. 

rithms could be programmed into PXD such as filtering, An end to end clock synchronizer 216 also monitors the 

echo cancellation and gain. 40 state of the jitter buffer 208. The clock synchronizer 216 

After the PXD, the data signals are coupled to the network controls the data transmission rate of the data pump trans- 

VHD via the switchboard 32'. The packet data modem mitter 210 in correspondence to the state of the jitter buffer 

exchange provides two way communication between a cir- 208. When the jitter buffer 208 is below a pre-determined 

cuit switched network and packet based network with two threshold level, the clock synchronizer 216 reduces the 

basic functional units, a demodulation system and a 45 transmission rate of the data pump transmitter 210. 

re-modulation system. In the demodulation system, the Likewise, when the jitter buffer 208 is above a pre- 

network VHD exchanges data signals from a circuit determined threshold level, the clock synchronizer 216 

switched network, or a telephony device directly, to a packet increases the transmission rate of the data pump transmitter 

based network. In the re-modulation system, the network 210. 

VHD exchanges data signals from the packet based network 50 A rate negotiator 218 synchronizes the connection rates at 

to the PSTN line, or the telephony device. the network gateways 182a, 182b, 182c (see FIG. 12). The 

In the demodulation system, the data signals are received rate negotiator receives rate control codes 218a from the 

and buffered in an ingress media queue 198. A call nego- local modem via the data pump state machine 220 and rate 

tiator 200 determines the type of modem connected locally control codes 218b from the remote modem via the depack- 

via a circuit switched network, such as a PSTN line carrying 55 etizing engine 206. The rate negotiator 218 forwards the 

data signals modulated by a voiceband carrier (e.g., 8 KHz.), remote rate control codes 218a received from the remote 

as well as the type of modem connected remotely via a modem to the local modem via commands sent to the data 

packet based network. The call negotiator 200 utilizes V.25 pump state machine 220. The rate negotiator 218 forwards 

automatic answering procedures and V.8 auto -baud software the local rate control codes 218c received from the local 

to automatically detect modem capability. The call negotia- 60 modem to the remote modem via the packetization engine 

tor 200 receives the data signals 200 a (ANSam and V.8 204. Based on the exchanged rate codes the rate negotiator 

menus) from the ingress media queue 198, as well as AA, 218 establishes a common data rate between the calling and 

AC and other message indications 220b from the local answering modems. During the data rate exchange 

modem via a data pump state machine 220, to determine the procedure, the jitter buffer 208 should be disabled by the rate 

type of modem in use locally. The call negotiator also 65 negotiator 218 to prevent data transmission between the call 

receives ANSam, AA, AC and other indications from a and answer modems until the data rates are successfully 

remote modem (not shown) located on the opposite end of negotiated. 
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An error control synchronizer 222 performs a similar In the described exemplary embodiment, end to end clock 

function by ensuring that the network gateways utilize a synchronizer 216 monitors the space available within the 

common error protocol. The error control synchronizer 222 jitter buffer 208 and utilizes water marks to determine 

processes local error control messages 222a from the data whether the data rate of the data pump transmitter 210 

pump receiver 202 in addition to remote V14/V42 indica- 5 should be adjusted. Network jitter may cause timing adjust- 

tioos 222b from the depacketizing engine 206. The error ments t0 b e made. However, this should not adversely affect 

control synchronizer 222 forwards V.14/V42 negotiation tne data pum receiver of the answering modem as these 

messages 222c to the local modem via the data pump timin adjustments are madc vcry gradua n y . 

Tw^a^o J & y° T SS^i s > n ? hr( ? mz f 22 } for " 2. Rate Synchronization 

wards V14/V.42 indications 222c/ from the local modem to Rate hronization refers t0 the ss b which two 

the remote modem via the packetization engine 204. A . , ' , . 4 , , * , \ 

The packet data modem exchange service preferably tele / hon y dev ! ces connected at the same data rate prior 

utilizes indication packets as a means for communicating to data ^nsmissioo. In the context of a modem ^connection 

answer tones, AA, AC and other indication signals across the m accordance with an exemplary embodiment of the present 

packet based network 10. However, the packet data modem invention, each modem is coupled to a signal processing 

exchange service supports data pumps such as V.22bis and 15 system, which for the purposes of explanation is operating 

V 32bis which do not include a well defined error recovery in a network gateway, either directly or through a PSTN line, 

mechanism, so that the modem connection may be termi- In operation, each modem establishes a modem connection 

nated whenever indication packets are lost. Therefore, either with its respective network gateway, at which point, the 

the packet data modem exchange or upper application layer modems begin relaying data signals across a packet based 

should ensure proper delivery of indication packets when 20 network. The problem that arises is that each modem may 

operating in a network environment that does not guarantee negotiate a different data rate with its respective network 

packet delivery. gateway, depending on the line conditions and user settings. 

The packet data modem exchange service can ensure In this instance, the data signals transmitted from one of the 

delivery of the indication packets by periodically modems will enter the packet based network faster than it 

re-transmitting the indication packet until some expected 25 can be extracted at the other end by the other modem. The 

packets are received. For example, in V.32bis relay the call resulting overflow of data signals may result in a lost 

negotiator operating under the packet data modem exchange connection between the two modems. To prevent data signal 

on the answer network gateway periodically re-transmits overflow, it is, therefore, desirable to ensure that both 

ANSam answer tones from the answer modem to the calling modems negotiate to the same data rate. A rate negotiator 

modem, until the calling modem connects to the line and 30 can be used for this purpose. Although the the rate negotiator 

transmits carrier state AA. is described in the context of a signal processing system with 

Alternatively, the packetization engine can embed the the packet data modem exchange service invoked, those 
indication information directly into the packet header. In this skilled in the art will appreciate that the rate negotiator is 
approach the indication information is included in all pack- likewise suitable for various other telephony and telecom- 
ets transmitted across the packet based network, so that the 35 munications application. Accordingly, the described exem- 
system does not rely on the successful transmission of plary embodiment of the rate negotiator in a signal process- 
individual indication packets. Rather, if a given packet is ing system is by way of example only and not by way of 
lost, the next arriving packet contains the indication infor- limitation. 

mation in the packet header. Both methods increase the In an exemplary embodiment, data rate synchronization is 
traffic across the network. However, it is preferable to 40 achieved through a data rate negotiation procedure, wherein 
periodically re-transmit the indication packets because it has a calling modem independently negotiates a data rate with a 
less of a detrimental impact on network traffic. calling network gateway, and a answer modem indepen- 
1. End to End Clock Synchronization dently negotiates a data rate with a answer data relay. The 
Slight differences in the clock frequency of the calling calling and answer network gateways, each having a signal 
modem and the answer modem are expected, since the baud 45 processing system running a packet exchange service, then 
rate tolerance for a typical modem data pump is ±100 ppm. exchange data packets containing information on the inde- 
This tolerance corresponds to a relatively low depletion or pendently negotiated data rates. If the independently nego- 
build up rate of 1 in 5000 words. However, the length of a tiated data rates are the same, then each rate negotiator will 
modem session can be very long, so that uncorrected dif- enable its respective network gateway and data transmission 
ference in clock frequency can result in jitter buffer under- 50 between the call and answer modems will commence, 
flow or overflow. Conversely, if the independently negotiated data rates are 
In an exemplary embodiment, the packet data modem different, the rate negotiator will renegotiate the data rate by 
exchange synchronizes the transmit clock of each network adopting the lowest of the two data rates. The call and 
gateway to the average rate at which data packets arrive at answer modems will then undergo retraining or rate 
their respective jitter buffer. The data pump transmitter 210 55 re-negotiation procedures by their respective network gate- 
examines the egress media queue 212 at the beginning of ways to establish a new connection at the renegotiated data 
each frame. In accordance with the remaining buffer space, rate. The advantage of this approach is that the data rate 
data pump transmitter 210 modulates that number of digital negotiation procedure takes advantage of existing modem 
data samples required to produce a total of slightly more or functionality, namely, the retraining mechanism, and puts it 
slightly less than 80 samples per frame, assuming that the 60 to alternative usage. Moreover, by retraining both the call 
data pump transmitter 210 is invoked once every 10 msec. and answer modem (one modem will already be set to the 
The data pump transmitter 210 gradually adjusts the number renegotiated rate) the modem connection should not be lost 
of samples per frame to allow the receiving modem to adjust due to timeout. 

to the timing change. Typically, the data pump transmitter In an alternate method for rate synchronization, the call- 

210 uses an adjustment rate of about one ppm. In addition, 65 ing and answer modems can directly negotiate the data rate, 

the maximum adjustment rate should be less than about 200 This method is not preferred for modems with time con- 

ppm. strained handshaking sequences such as, for example, 
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modems operating in accordance with the V.22bis or the signal (SI) indicates that 2400 bps operation is to be used. 
V.32bis standards. The round trip delay accommodated by In addition, the absence of such a tone indicates that 1200 
these standards could cause the modem connection to be lost bps should be selected. The duration of the signal is typically 
due to timeout. Instead, retrain or rate renegotiation should about 100 msec, making it likely that the calling modem will 
be used for data signals transferred in accordance with the 5 perform rate determination (assuming that it selects 2400 
V22bis and V.32bis standards, whereas direct negotiation of bps) before rate indication from the answer modem arrives, 
the data rate by the local and remote modems can be used for Therefore, the rate negotiator within the packet data modem 
data exchange in accordance with the V.34 and V.90 (a exchange operating in the calling network gateway should 
digital modem and analog modem pair for use on PSTN select 2400 bps operation and proceed with the handshaking 
lines at data rates up to 56,000 bps downstream and 33,600 10 procedure. If the answer modem is limited to a 1200 bps 
upstream) standards. connection, rate re-negotiation is typically used to change 

A single industry standard for the transmission of modem the operational data rate of the calling modem to 1200 bps. 
data over a packet based network does not exists. However, In this case, if the calling modem selects 1200 bps, rate 
numerous common standards exists for transmission of re-negotiation would not be required, 
modem data at various data rates over the public switched is b. V.32bis Rate Synchronization 

telephone network. For example, V.22 is a common standard V34bis handshaking utilizes rate signals (messages) to 
used to define operation of 1200 bps modems. Data rates as specify the bit rate. A typical relay sequence in accordance 
high as 2400 bps can be implemented with the V.22bis with the V.32bis standard is shown in FIG. 14 and begins 
standard (the suffix "bis" indicates that the standard is an with the call negotiator operating under the packet data 
adaptation of an existing standard). The V.22bis standard 20 modem exchange in the answer network gateway relaying 
groups data into four bit words which are transmitted at 600 ANSam 270 answer tone from the answer modem to the 
baud. The V.32 standard supports full duplex, data rates of calling modem. After receiving the answer tone for a period 
up to 9600 bps over the general switched telephone network. of at least one second, the calling modem connects to the line 
A V.32 modem groups data into four bit words and transmits and repetitively transmits carrier state A 272. When the 
at 2400 baud. The V32bis standard supports duplex modems 25 calling network gateway detects AA, the calling network 
operating at data rates up to 14,400 bps on the general gateway relays this information to the answer network 
switched telephone network. In addition, the V.34 standard gateway. The packet data modem exchange operating on the 
supports data rates up to 33,600 bps on the general switched answer network gateway invokes operation in accordance 
telephone network. with the V.32bis standard upon receipt of AAindication. The 

V.42 is a standard error correction technique using 30 answer modem then transmits alternating carrier states A and 
advanced cyclical redundancy checks and the principle of C. If answer network gateway receives AC from the answer 
automatic repeat requests (ARQ). In accordance with the modem, the answer network gateway relays it to the calling 
V.42 standard, transmitted data is grouped into blocks and network gateway, thereby establishing operation in accor- 
cyclical redundancy calculations add error checking words dance with the V.32bis standard, allowing call negotiator 
to the transmitted data stream. The receiving modem calcu- 35 operating under the packet data modem exchange in the 
lates new error check information for the data block and calling network gateway to be terminated. Next, data rate 
compares the calculated information to the received error alignment is achieved by either of two methods, 
check information. If the codes match, the received data is In the first method for data rate alignment of a V32bis 
valid and another transfer takes place. If the codes do not relay connection, the calling modem and the answer modem 
match, an transmission error has occurred and the receiving 40 independently negotiate a data rate at each end of the 
modem requests a repeat of the last data block. This repeat network 280 and 282. Each network gateway forwards a 
cycle continues until valid data has been received. connection data rate indication 284 and 286 to the other 

Various voiceband data modem standards exist for error network gateway. Each network gateway compares the far 
correction and data compression. V.42bis and MNP5 are end data rate to its own data rate. The preferred rate is the 
examples of data compression standards. The handshaking 45 minimum of the two rates. Rate re-negotiation 288 and 290 
sequence for every modem standard is different so that the is invoked if the connection rate of either network gateway 
packet data modem exchange service should support numer- differs from the preferred rate. 

ous data transmission standards as well as numerous error In the second method, rate signals Rl, R2 and R3, are 
correction and data compression techniques. relayed to achieve data rate synchronization. FIG. 15 shows 

a, V.22 Rate Synchronization 50 a relay sequence in accordance with the V.32bis standard for 

The call negotiator, operating under the packet data this alternate method of rate synchronization. The call 
modem exchange on the answer network gateway, differen- negotiator relays the answer tone (ANSam) 292 from the 
tiates between modem types and relays the ANSam answer answer modem to the calling modem. When the calling 
tone. The answer modem transmits unscrambled binary ones modem detects answer tone it repetitively transmits carrier 
signal (USB1) indications to the answer mode gateway. The 55 state A 294, the calling network gateway relays this infor- 
answer network gateway forwards USB1 signal indications mation (AA) 296 to the answer network gateway. The 
to the calling network gateway. The call negotiator operating answer network gateway sends AA 298 to the answer 
under the packet data modem exchange service on the modem which initiates normal range tone exchange with the 
calling network gateway assumes operation in accordance answer modem. The answer network gateway forwards AC 
with the V.22bis standard and terminates the call negotiator. 60 300 to calling network gateway which in turn relays this 
The packet data modem exchange service, operating on the information 302 to the calling modem to initiate normal 
answer network gateway, invokes operation in accordance range tone exchange with the calling modem, 
with the V.22bis standard after an answer tone timeout The answer modem sends its first training sequence 304 
period and terminates the call negotiator 200. followed by Rl to the rate negotiator operating in the answer 

V.22bis handshaking does not utilize rate messages or 65 network gateway. When the answer network gateway 
signaling to indicate the selected bit rate as with most high receives Rl, it forwards Rl 306 to the calling network 
data rate pumps. Rather, the inclusion of a fixed duration gateway via the packetization engine operating in the answer 
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network gateway. The answer network gateway repetitively modem independently negotiate with the calling network 

sends training sequences to the answer modem, until receiv- gateway and the answer network gateway respectively. The 

ing an R2 indication 308 from the calling modem, and the calling network gateway and the answer network gateway 

training result of the calling network gateway (formatted as exchange training results in the form of MP sequences when 

a rate signal). The calling network gateway forwards the Rl 5 Phase IV of the independent negotiations is reached, 

indication 310 of the answer modem to the calling modem. However, ^the calling network gateway and the answer 

The calling modem sends training sequences to calling network gateway are prevented from relaying MP sequences 

network gateway 312. The calling network gateway deter- to the calling modem and the answer modem respectively 

mines the data rate capability of the calling modem, and until the training results for both network gateways and the 

forwards this training result to the answer network gateway io MP sequences for both modems are available. If symmetric 

in a data rate signal format. The calling modem sends R2 rate is enforced, the maximum answer data rate and the 

308 to the calling network gateway which forwards it to the maximum call data rate of the four MP sequences are 

answer network gateway. The calling network gateway compared. The lower data rate of the two maximum rates is 

sends training sequences to the calling modem until receiv- the preferred data rate. Each network gateway sends the MP 

ing an R3 signal 314 from the answer modem via the answer 15 sequence with the preferred rate to it's respective modem so 

network gateway. that the calling and answer modems operate at the preferred 

The answer network gateway performs a logical AND data rate, 

operation on the Rl signal from the answer modem, the R2 If asymmetric rates are supported, then the preferred 

signal from the calling modem and the training sequences of call-answer data rate is the lesser of the two highest call- 

the calling network gateway to create a second rate signal R2 2D answer rates of the four MP sequences. Similarly, the 

316, which is forwarded to the answer modem. The answer preferred answer-call data rate is the lesser of the two 

modem sends its second training sequence followed by R3. highest answer-call rates of the four MP sequences. Data rate 

The answer network gateway relays R3 314 to the calling capabilities may also need to be modified when the MP 

network gateway which forwards it to the calling modem sequence are formed so as to be sent to the calling and 

and begins operating at the R3 specified bit rate. However, 25 answer modems. The MP sequence sent to the calling and 

this method of rate synchronization is not preferred for answer modems, is the logical AND of the data rate capa- 

V 32b is due to time constrained handshaking. bilities from the four MP sequences. 

c. V.34 Rate Synchronization d. V.90 Rate Synchronization 

Data transmission in accordance with the V.34 standard The V.90 standard utilizes a digital and analog modem 

utilizes a modulation parameter (MP) sequence to exchange 30 pair to transmit modem data over the PSTN line. The V.90 

information pertaining to data rate capability. The MP standard utilizes MP sequences to convey training results 

sequences can be exchanged end to end to achieve data rate from a digital to an analog modem, and a similar sequence, 

synchronization. Initially, the call negotiator operating under using constellation parameters (CP) to convey training 

the packet data modem exchange in the answer network results from an analog to a digital modem. Under the V90 

gateway relays the answer tone (ANSam) from the answer 35 standard, the timeout period is 15 seconds compared to a 

modem to the calling modem. When the calling modem timeout period of 30 seconds under the V.34 standard. In 

receives answer tone, it generates a CM indication. When addition, the analog modems control the handshake timing 

the calling network gateway receives a CM indication, it during training. In an exemplary embodiment, the calling 

forwards it to the answer network gateway which then modem and the answer modem are the V.90 analog modems, 

communicates the CM indication with the answer modem. 40 As such the calling modem and the answer modem are 

The answer modem then responds with JM, which is relayed beyond the control of the network gateways during training, 

to the calling modem via the calling network gateway. If the The digital modems control the timing during transmission 

calling network gateway then receives CJ, the call negotiator of TRNld. The digital modem uses TRNld to train its echo 

operating under the packet data modem exchange, on the canceller. 

calling network gateway, initiates operation in accordance 45 When operating in accordance with the V.90 standard, the 

with the V.34 standard, and forwards a CJ indication to the call negotiator utilizes the V.8 recommendations for initial 

answer network gateway. If the JM menu calls for V.34, the negotiation. Thus, the invocation of the V.90 relay session is 

call negotiator operating under the packet data modem the same as that described for the V.34 standard. There are 

exchange on the answer network gateway initiates operation two configurations where V.90 relay may be used. The first 

in accordance with the V.34 standard and the call negotiator 50 configuration is data relay between two V.90 analog 

is terminated. If a standard other than V.34 is called for, the modems, i.e. the two network gateways are both configured 

appropriate procedure is invoked, such as those described as V.90 digital modems. The upstream rate according to the 

previously for V.22 or V.32bis. V.90 standard is limited to 33,600 bps. Thus, the maximum 

After a V34 relay connection is established, the calling data rate for an analog to analog relay is 33,600 bps. The 

modem and the answer modem freely negotiate a data rate 55 minimum data rate for a V.90 digital gateway will support is 

at each end of the network with the packet data modem 28,800 bps. Therefore, the connection must be terminated if 

exchange service operating on their respective network the maximum data rate for one or both of the upstream 

gateways. Each network gateway forwards a connection rate directions is less than 28,800 bps, and one or both the 

indication to the other gateway. Each gateway compares the downstream direction is in V90 digital mode. Therefore, the 

far end bit rate to the rate transmitted by each gateway. The 60 V.34 relay is preferred over V.90 analog to analog data relay, 

preferred rate is the minimum of the two rates. Rate A second configuration is a connection between a V.90 

re-negotiation is invoked if the connection rate at the calling analog modem and a V.90 digital modem. A typical example 

or receiving end differs from the preferred rate, to force the of such a configuration is when a user within a packet based 

connection to the desired rate. PABX system dials out into a remote access server (RAS) or 

In an alternate method for V.34 rate synchronization MP 65 an Internet service provider (ISP) that uses a centra] site 
sequences are utilized to achieve rate synchronization with- modem for physical access that is V.90 capable. The con- 
out rate re -negotiation. The calling modem and the answer nectioo from PABX to the central site modem may be either 
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through PSTN or directly through an ISDN, Tl or El the information frame affects the overall memory required to 

interface. Thus the V.90 embodiment should support an implement the protocol, a information frame length larger 

analog modem interfacing directly to ISDN, Tl or El. than 260 octets is unlikely. 

For analog to digital modem connection, the connections T° e spoofing logic 214 stores a threshold water mark 

at both ends of the packet based network should be either 5 ( with a value set to b . e approximately equal to the maximum 

digital or analog to achieve proper rate synchronization. The len Sth of an HDLC information frame). The spoofing logic 

analog modem decides whether to select digital mode as 214 searches for HDLC flags (0111110 bit sequence) within 

specified in INFOla, so that INFOl a should be relayed the jitter buffer 208 when the amount of data signal stored 

from end to end before operation mode can be synchronized. ^! hm 4 ? e ^'^u 2 °? ^ bc *T ^ sh ? ld 1 ™? A 

The relay sequence for achieving mode alignment is as 10 ™ hcn * c t0 ^ SP °K°^ g ^ ^ 

follows 6 & begins to insert HDLC flags into the jitter buffer 208, and 

™ " . . . . tVTrrt , . continues until the amount of data signal within the jitter 

The calling network gateway receives an INFOla signal buffef 20g ^ (han ^ * ^ 

from the calkng modem. The calling network gateway sends 3 Rctrain and Ratc RcnC gotiation 

a mode indication to the answer network gateway indicating When a relrain 0CClirs> an indication should be forwarded 

whether digital or analog will be used. Operation then begins is to the ne twork gateway at the end of the packet based 

in the mode specified in INFOla. The answer modem sends network. The network gateway receiving a retrain indication 

a signal to the answer network gateway. The answer network should initiate retrain with the connected modem to keep 

gateway performs line probe processing on this signal to data flow in synchronism between the two connections. Rate 

determine whether digital mode can be used. Upon receipt synchronization procedures as previously described should 

of the mode indication signal from the calling network 20 be used to maintain data rate alignment after retrains. 

gateway, the answer network gateway sends an INFOla Similarly, rate renegotiation causes both the calling and 

sequence to the answer modem. If analog mode is indicated, answer network gateways and to perform rate renegotiation. 

the answer network gateway proceeds with analog mode However, rate signals or MP (CP) sequences should be 

operation. If digital mode is indicated and digital mode can exchanged per method two of the data rate alignment as 

be supported by the answer modem, the answer network 25 previously discussed for a V.32bis or V.34 rate synchroni- 

gateway sends an INFOla sequence to the answer modem zation whichever is appropriate. 

indicating that digital mode is desired and proceeds with 4. Error Correcting Mode Synchronization 

digital mode operation. Error control (V.42) and data compression (V.42bis) 

Alternatively, if digital mode is indicated and digital mode modes should be synchronized at each end of the packet 

can not be supported by the answer modem, the calling 30 based network by one of two alternate methods. In the first 

modem must be forced into analog mode by one of three method, the calling modem and the answer modem inde- 

alternate methods. First, some commercially available V.90 pendently negotiate modes on their own, transparent to the 

analog modems may revert to analog mode after several modem network gateways. This method is preferred for 

retrains. Thus, one solution is to force retrains until the connections wherein the network delay plus jitter is rela- 

calling modem selects analog mode operation. In an alter- 35 tively small, as characterized by an overall round trip delay 

nate method, the call network gateway modifies its line of less than 700 msec. 

probe so as to force calling modem 180 to select analog In an alternate method, the error control synchronizers 
mode. In a third method, the calling modem and the answer 222 operating with the network gateways force the user 
modem operate in different modes. Under this method if the modems out of LAPM mode into a non-error correcting 
answer modem can not support a 28,800 bps data rate the 40 protocol (V.14). Preferably, the error correction synchro- 
connection is terminated. nizer 222 operating under the packet data modem exchange 
2. Data Mode Spoofing 54 in the calling network gateway waits a period of time 
The jitter buffer 208 may underflow during long packet (about 650 msec.) for an error correction mode indication 
delivery delay. The jitter buffer 208 underflow can cause the from the opposite end of the network. If an indication 
data pump transmitter 210 to run out of data, so that the jitter 45 arrives, then the first method is used. If not, the error 
buffer 208 must be spoofed with bit sequences. Preferably correction synchronizer 222 operating under the packet data 
the bit sequences are benign in most applications. While modem exchange in the calling network gateway responds 
transmitting start-stop characters in accordance with V.14 with an ADP followed by HDLC flags. The HDLC flags 
recommendations, the spoofing logic 214 checks for char- spoof the calling modem until the an error correction mode 
acter format and boundary (number of data bits, start bits 50 indication arrives. If mode indication is received before 
and stop bits) within the jitter buffer 208. The spoofing logic timeout, which indicates error control mode, then unnum- 
214 must account for stop bits omitted due to asynchronous- bered acknowledgment (UA) response is sent to the calling 
to-synchronous conversion. Once the spoofing logic 214 modem and the calling network gateway proceeds with an 
locates character boundary, ones can be added to spoof the error control connection. 

remote modem and keep it in the mark state. The length of 55 The V.42 recommendation does not specify the length of 

time a modem can be spoofed with ones depends only upon time HDLC flags will be accepted before the calling modem 

the application program driving the user modem. timeouts. Therefore, empirical tests should be performed to 

While in error correction mode the spoofing logic 214 determine how long the calling modem within a particular 

checks for HDLC flag (HDLC frame boundary) within the implementation can be spoofed in this manner, 

jitter buffer 208. The jitter buffer 208 should be sufficiently 60 Alternatively, if the calling network gateway receives 

large to guarantee that at least one complete HDLC frame is mode indication indicating V.14 or a timeout has occurred, 

contained within the jitter buffer 208. The default length of the calling network gateway issues a disconnect mode (DM) 

an HDLC information frame is 132 octets. The V.42 rec- response to indicate exit from V.42. The calling modem 

ommendations for error correction of data circuit terminat- should then revert to non-error control mode, 

ing equipment (DCE) using asynchronous- to-synchronous 65 Data compression mode is negotiated within V.42 so that 

conversion does not specify a maximum length for an the appropriate mode indication can be relayed when the 

HDLC information frame. However, because the length of calling and answer modems have entered into V.42 mode. 
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A third mode is to allow modems at both ends to freely 
negotiate the error control mode with their respective net- 
work gateways. The network gateways must fully support all 
error correction modes when using this method. Also, 
because of flow control issues, this method cannot support 5 
the scenario where one modem selects V.14 while the other 
modem selects a mode other than V.14. For the case where 
V.14 is negotiated at both sides of the packet based network, 
the 8-bit no parity format is assumed and the raw demodu- 
lated data bits are transported between the network gate- 1Q 
ways. With all other cases, each gateway shall extract the 
de-framed (error corrected) data bits and forwards them to 
its counterpart at the opposite end of the network. Flow 
control procedures within the error control protocol can be 
used to handle network delay. The advantage of this method 
over the first method is its ability to handle large network 15 
delays and also the scenario where the local connection rates 
at the network gateways are different. However, packets 
transported over the network in accordance with this method 
must be guaranteed to be error free. 

Although a preferred embodiment of the present invention 20 
has been described, it should not be construed to limit the 
scope of the appended claims. For example, the present 
invention can be implemented by both a software embodi- 
ment or a hardware embodiment. Those skilled in the art will 
understand that various modifications may be made to the 25 
described embodiment. Moreover, to those skilled in the 
various arts, the invention itself herein will suggest solutions 
to other tasks and adaptations for other applications. It is 
therefore desired that the present embodiments be consid- 
ered in all respects as illustrative and not restrictive, refer- 30 
ence being made to the appended claims rather than the 
foregoing description to indicate the scope of the invention. 

What is claimed is: 

1. A method of transmitting data, comprising: 
negotiating a first data rate between a first rate negotiator 35 

and a source telephony device; 

negotiating a second data rate between a second rate 
negotiator and a destination telephony device; and 

renegotiating the first negotiated data rate and the second 
negotiated data rate to allow data transmission between 40 
the source and destination telephony devices, if the first 
negotiated data rate and the second negotiated data rate 
are different. 

2. The method of claim 1 wherein the source and desti- 
nation telephony devices each comprises a modem. 45 

3. The method of claim 1 wherein the first data rate 
negotiation comprises setting the first negotiated data rate 
based on the source telephony device data rate and the first 
rate negotiator data rate. 

4. The method of claim 3 wherein the negotiated data rate 50 
is set to the lower of the source telephony device data rate 
and the first rate negotiator data rate. 

5. The method of claim 1 wherein the data rate renego- 
tiation comprises setting the renegotiated data rate based on 

a system data rate and the first negotiated data rate. 55 

6. The method of claim 5 wherein the system further 
comprises a data exchange, the method further comprising 
setting a system data rate based on the negotiated destination 
telephony device data rate. 

7. The method of claim 5 wherein the renegotiated data 60 
rate is set to the lower of the first negotiated data rate and the 
second negotiated data rate. 

8. The method of claim 5 wherein the data rate renego- 
tiation is performed over a packet based network. 

9. The method of claim 8 wherein the data rate renego- 65 
tiation further comprises inhibiting receipt of data packets 
from the packet based network. 



10. The method of claim 5 wherein the data rate renego- 
tiation further comprises resetting the source telephony 
device with the renegotiated data rate. 

11. The method of claim 10 wherein the source telephony 
device is reset by retraining the source telephony device 
with the renegotiated data rate. 

12. A method of synchronizing a data rate, comprising: 
initializing a data rate; 

receiving a data rate from a source telephony device; 
setting a first negotiated data rate based on the initial date 

rate and the data rate for the source telephony device; 
receiving a second data rate from a destination telephony 

device; 

setting a second negotiated data rate based on the initial 
date rate and the data rate for the destination telephony 
device; and 

setting a renegotiated data rate based on the first negoti- 
ated data rate and the second negotiated data rate. 

13. The method of claim 12 wherein the first negotiated 
data rate is set to the lower of the initial data rate and the data 
rate for the source telephony device. 

14. The method of claim 12 wherein the setting of the first 
negotiated data rate comprises setting the data rate for the 
source telephony device to the first negotiated data rate. 

15. The method of claim 14 wherein the setting of the 
renegotiated data rate comprises resetting the set data rate 
for the source telephony device to the renegotiated data rate. 

16. The method of claim 12 wherein the renegotiated data 
rate is set to the lower of the first negotiated data rate and the 
second negotiated data rate. 

17. The method of claim 12 wherein the system comprises 
a data exchange, and wherein the receiving of a second data 
rate comprises negotiating a system data rate between the 
destination telephony device and the data exchange. 

18. The method of claim 17 wherein the system data rate 
negotiation comprises setting a data rate for the destination 
telephony device to the system data rate. 

19. The method of claim 18 wherein the setting of the 
renegotiated data rate comprises resetting the set data rate 
for the destination telephony device to the renegotiated data 
rate. 

20. The method of claim 17 wherein the destination 
telephony device comprises a modem. 

21. The method of claim 12 wherein the source telephony 
device comprises a modem, 

22. The method of claim 12 wherein the renegotiation of 
the data rate is over a packet based network. 

23. The method of claim 22 wherein the data rate rene- 
gotiation comprising inhibiting receipt of data packets. 

24. A method of synchronizing a data rate, comprising: 
exchanging data rates between a first data exchange and 

a first telephony device; 
negotiating a first data rate based on the exchanged data 

rates between the first data exchange and the first 

telephony device; 
exchanging data rates between a second data exchange 

and a second telephony device; 
negotiating a second data rate based on the exchanged 

rates between the second data exchange and the second 

telephony device; 
exchanging the first and the second data rates over a 

packet based network; and 
negotiating a third data rate based on the exchanged first 

and second data rates. 

25. The method of claim 24 further comprising resetting 
the first and the second telephony devices with the third data 
rate. 
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26. The method of claim 25 wherein each of the first and a second data exchange having a data rate; 

second telephony devices are reset by retraining each of the a SCCO nd rate negotiator which exchanges the data rates 

first and second telephony devices with the third data rate. between the second data exchange and the second 

27. The method of claim 24 wherein the first and the telephony device and negotiates a second data rate 
second telephony devices each comprises a modem. 5 based on the exchanged data rates between the second 

28. The method of claim 24 wherein the first data rate data exchange and the second telephony device, 
negotiation comprises setting the first data rate to the lower wherein the first and the second rate negotiators coop- 
of the exchanged data rates between the first data exchange era t e to exchange the first and the second data rates and 
and the first telephony device. negotiate a third data rate based on the exchanged first 

29. The method of claim 24 wherein the second data rate id anc j second data rates* and 

negotiation comprises setting the second data rate to the a ket based netWQrk J H ^ first data exchange to 

lower of the exchanged data rates between the second data ^ secQnd ^ exch 

exchange and the second telephony device. 42 ^ si ^ transmission tcm of daim 41 whercin 

30. T^e method of claim 24 wherein the third data rate each of ^ firgt and me secQnd fate Uators sets me daU 
negotiation comprises setting the tod data rate to the lower is rate of {t& ctive Aphony devicelo the third data rate, 
of the exchanged first and second data rates. 43 ^ ^ transmission tem of claim 41 wherein 

31. The method of claim 24 wherein the third data rate ^ fifSt and ^ secQnd h devices each rises a 
negotiation comprises inhibiting receipt of data packets at modem 

each of the first and second data exchanges. 44 ^ ^ transmission tem of claim 41 wherein 

32. A data exchange comprising: 20 me first data rate comprises me lower of lhe exchange d data 
a first rate negotiator capable of negotiating a first data rates between the first data exchange and the first telephony 

rate with a source telephony device; and device, 

a second rate negotiator capable of negotiating a second 45. The signal transmission system of claim 41 wherein 

data rate with a destination-telephony device, wherein the second data rate comprises the lower of the exchanged 

the first rate negotiator and the second rate negotiator 25 data rates between the second data exchange and the second 

renegotiate the negotiated first and second data rates to telephony device. 

allow data transmission between the first and second 46. The signal transmission system of claim 41 wherein 

telephony devices, if the first negotiated data rate and the third data rate comprises the lower of the first and the 

the second negotiated data rate are different. second data rates. 

33. The data exchange of claim 32 wherein the first rate 30 47. The signal transmission system of claim 41 wherein 
negotiator initializes an initial data rate, the negotiated first each of the first and the second rate negotiators inhibit 
data rate being based an initial data rate of the source reception of data packets from the packet based network 
telephony device and the initial data rate of the rate nego- during the third data rate negotiation. 

tiator. 48. The signal transmission system of claim 41 wherein 

34. The data exchange of claim 32 wherein the first 35 each of the first and the second data exchanges comprises a 
negotiated data rate comprises the lower of the initial source data pump which exchanges data signals between the packet 
telephony device data rate and the first initial rate negotiator based network and its respective telephony device. 

data rate. 49. Computer-readable media embodying a program of 

35. The data exchange of claim 32 wherein the renego- instructions executable by a computer to perform a method 
dated data is based on a data rate for the system and the first 40 of transmitting data, the method comprising: 
negotiated data rate. negotiating a first data rate between a first rate negotiator 

36. The data exchange of claim 35 wherein the renego- anc j a source telephony device; 

tiated data rate comprises the lower of the first negotiated nego tiating a second data rate between a second rate 

data rate and the second negotiated data rate. negotiator and a destination telephony device; and 

37. The data exchange of claim 35 wherein the first rate . . , „ 

negotiator is further capable of resetting the source tele- renegotiatuig the first negotaated data rate and the second 

phony device with the renegotiated data rate. negotiated data rate to allow data transm>ssion between 

38. The data exchange of claim 37 wherein the first rate ^ s™™*?* d^naUon telephony devices, if the first 
negotiator rests the source telephony device by retraining the negotiated data rate and the second negotiated data rate 
source telephony device with the renegotiated data rate. 50 mt ^Z eren ' , , , i 

39. The data exchange of claim 32 further comprising a 50. The computer-readable media of claim 49 wherein the 

uiru-j.-iu. source and destination telephony devices each comprises a 

data pump capable of exchanging data signals between a r j r 

circuit switched network and a packet based network at the mo ... em ^ ... ,. , . . .„ . ... 

renegotiated data rate 51 ^ computer-readable media of claim 49 wherein the 

40. The data exchange of claim 39 wherein the secondrate 55 firs ' ^ rate se » m S ' hc firs < 
negotiator inhibits receipt of the data signals from the packet ated ^ rate baSed ° n th ° T* telephony deVlce data rate 
based network during data rate renegotiation. and me first rate negotiator data rate 

41. A signal transmission system, comprising: 52. The computer-readable media of chum 51 wherein the 

. t . . . . , negotiated data rate is set to the lower of the source 

a first telephony device having a data rate; 6Q tel * phony device data ra , e ^ me first rate negotiator data 

a first data exchange having a data rate; rate 

a first rate negotiator which exchanges the data rates 53. The computer-readable media of claim 49 wherein the 

between the first data exchange and the first telephony data rate renegotiation comprises setting the renegotiated 

device and negotiates a first data rate based on the data rate based on a system data rate and the first negotiated 

exchanged data rates between the first data exchange 55 data rate. 

and the first telephony device; 54. The computer-readable media of claim 53 wherein the 

a second telephony device having a data rate; system further comprises a data exchange, the method 
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further comprising setting a system data rate based on the 
negotiated destination telephony device data rate. 

55. The computer-readable media of claim 53 wherein the 
renegotiated data rate is set to the lower of the first negoti- 
ated data rate and the second negotiated data rate. 

56. The computer-readable media of claim 53 wherein the 
data rate renegotiation is performed over a packet based 
network. 

57. The computer-readable media of claim 56 wherein the 
data rate renegotiation further comprises inhibiting receipt 
of data packets from the packet based network. 

58. The computer-readable media of claim 53 wherein the 
data rate renegotiation further comprises resetting the source 
telephony device with the renegotiated data rate. 

59. The computer-readable media of claim 58 wherein 
each of the source and destination telephony devices are 
reset by retraining each of the source and destination tele- 
phony devices with the third data rate. 

60. Computer-readable media embodying a program of 
instructions executable by a computer to perform a method 
of synchronizing a data rate, the method comprising: 

initializing a data rate; 

receiving a data rate from a source telephony device; 
setting a first negotiated data rate based on the initial date 

rate and the data rate for the source telephony device; 
receiving a second data rate from a destination telephony 

device; 

setting a second negotiated data rate based on the initial 
date rate and the data rate for the destination telephony 
device; and 

setting a renegotiated data rate based on the first negoti- 
ated data rate and the second negotiated data rate. 

61. The computer-readable media of claim 60 wherein the 
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67. The computer- readable media of claim 66 wherein the 
setting of the renegotiated data rate comprises resetting the 
set data rate for the destination telephony device to the 
renegotiated data rate, 

68. The computer-readable media of claim 65 wherein the 
destination telephony device comprises a modem. 

69. The computer-readable media of claim 60 wherein the 
source telephony device comprises a modem. 

70. The computer-readable media of claim 60 wherein the 
renegotiation of the data rate is over a packet based network. 

71. The computer- readable media of claim 70 wherein the 
data rate renegotiation comprising inhibiting receipt of data 
packets. 

72. A data exchange for a transmission system compris- 
ing: 

first negotiating means for negotiating a first data rate 
with a source telephony device; 

second negotiating means for negotiating a second data 
rate with a destination telephony device, wherein the 
first rate negotiator and the second rate negotiator 
renegotiate; and 

renegotiating means for renegotiating the negotiated first 
and second data rates to allow data transmission 
between the first and second telephony devices, if the 
first negotiated data rate and the second negotiated data 
rate are different. 

73. The data exchange of claim 72 wherein the negotia- 
tion means initializes an initial data rate, the negotiated data 
rate being based an initial data rate of the source telephony 
device and the initial data rate of the data exchange. 

74. The data exchange of claim 73 wherein the negotiated 
data rate comprises the lower of the initial source telephony 



first negotiated data rate is set to the lower of the initial data 35 device data rate and the initial data exchange data rate. 



rate and the data rate for the source telephony device. 

62. The computer-readable media of claim 60 wherein the 
setting of the first negotiated data rate comprises setting the 
data rate for the source telephony device to the first nego- 
tiated data rate. 

63. The computer-readable media of claim 62 wherein the 
setting of the renegotiated data rate comprises resetting the 
set data rate for the source telephony device to the renego- 
tiated data rate. 

64. The computer-readable media of claim 60 wherein the 
renegotiated data rate is set to the lower of the first negoti- 
ated data rate and the second negotiated data rate. 

65. The computer-readable media of claim 60 wherein the 
system comprises a second telephony device and a data 
exchange, and wherein the receiving of a second data rate 
comprises negotiating the system data rate between the 
second telephony device and the data exchange. 

66. The computer-readable media of claim 65 wherein the 
system data rate negotiation comprises setting a data rate for 
the destination telephony device to the second negotiated 
data rate. 
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75. The data exchange of claim 72 wherein the renego- 
tiated data is based on a data rate for the transmission system 
and the first negotiated data rate. 

76. The data exchange of claim 75 wherein the renego- 
tiated data rate comprises the lower of the first negotiated 
data rate and the second negotiated data rate. 

77. The data exchange of claim 72 further comprising 
reset means for resetting the source telephony device with 
the renegotiated data rate. 

78. The data exchange of claim 77 wherein the reset 
means comprises means for retraining the source telephony 
device with the renegotiated data rate. 

79. The data exchange of claim 72 further comprising 
means for exchanging data signals between a circuit 
switched network and a packet based network at the rene- 
gotiated data rate. 

80. The rate negotiator of claim 79 further comprising 
means for inhibiting receipt of the data signals from the 
packet based network during data rate renegotiation. 
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